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ABSTRACT 


A  Galerkin-based  family  of  numerical  formulations  is  presented  for  solving  nonlinear  op¬ 
timal  control  problems.  This  dissertation  introduces  a  family  of  direct  methods  that  cal¬ 
culate  optimal  trajectories  by  discretizing  the  system  dynamics  using  Galerkin  numerical 
techniques  and  approximate  the  cost  function  with  Gaussian  quadrature.  In  this  numer¬ 
ical  approach,  the  analysis  is  based  on  /A-norms.  An  important  result  in  the  theoretical 
foundation  is  that  the  feasibility  and  consistency  theorems  are  proved  for  problems  with 
continuous  and/or  piecewise  continuous  controls.  Galerkin  methods  may  be  formulated  in 
a  number  of  ways  that  allow  for  efficiency  and/or  improved  accuracy  while  solving  a  wide 
range  of  optimal  control  problems  with  a  variety  of  state  and  control  constraints.  Numerical 
formulations  using  Lagrangian  and  Legendre  test  functions  are  derived.  One  formulation 
allows  for  a  weak  enforcement  of  boundary  conditions,  which  imposes  end  conditions  only 
up  to  the  accuracy  of  the  numerical  approximation  itself.  Additionally,  the  multi-scale 
formulation  can  reduce  the  dimension  of  multi-scale  optimal  control  problems,  those  in 
which  the  states  and  controls  evolve  on  different  timescales.  Finally,  numerical  examples 
are  shown  to  demonstrate  the  versatile  nature  of  Galerkin  optimal  control. 
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CHAPTER  1: 
INTRODUCTION 


The  last  two  decades  have  proven  to  be  a  time  of  active  research  for  numerical  meth¬ 
ods  for  optimal  control.  Particularly,  direct  collocation  methods,  such  as  pseudospectral 
(PS)  methods,  have  received  much  attention  [1-8].  PS  methods  produce  accurate  solutions 
on  a  wide  variety  of  optimal  control  problems.  Two  recent  highlights  are  the  success¬ 
ful  use  of  the  Legendre  PS  method  for  the  first  ever  zero-propellant  attitude  maneuver  of 
the  International  Space  Station  [4]  and  the  first  ever  minimum-time  rotational  maneuver 
performed  in  orbit  by  a  NASA  space  telescope  called  TRACE  [8].  In  the  Legendre  PS 
method  [1,  2,  5,  6,  9,  10],  the  problem  is  discretized  at  the  Legendre-Gauss-Lobatto  (LGL) 
points,  Legendre-Gauss-Radau  (LGR)  points  or  Legendre-Gauss  (LG)  points.  The  states 
are  approximated  with  globally  interpolating  Lagrange  polynomials  and  the  cost  function 
is  typically  approximated  using  Gaussian  quadrature  rule.  Other  variants  of  the  PS  method 
include  the  Chebyshev  PS  method  [11],  the  PS  knotting  method  [12]  and  the  Bellman 
method  [13].  The  Legendre  PS  method  will  be  outlined  in  Chapter  3  of  this  dissertation, 
preceded  by  a  review  of  mathematical  topics  in  Chapter  2. 

While  PS  methods  have  shown  to  be  good  all-round  methods  for  solving  nonlinear 
optimal  control  problems,  approximating  the  derivative  of  a  function  using  a  standard  PS 
differentiation  matrix  (such  as  the  Legendre  PS  differentiation  matrix)  may  introduce  errors 
into  the  approximation.  Chapter  3  highlights  this  issue  with  the  use  of  Jackson’s  Theorem. 
Additionally,  Chapter  3  will  motivate  the  use  of  the  weak  integral  formulation  to  approx¬ 
imate  the  system’s  dynamics.  This  leads  to  the  creation  of  a  family  of  Galerkin-based 
formulations  called,  “Galerkin  optimal  control.” 

The  family  of  methods  proposed  in  this  dissertation  are  derived  from  Galerkin  nu¬ 
merical  techniques  that  have  been  developed  for  numerical  solutions  to  differential  equa¬ 
tions  since  the  early  1970s  [14-16].  In  addition  to  the  family  of  Galerkin  optimal  control 
formulations  that  are  presented,  this  dissertation  highlights  important  theorems  that  prove 
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method  feasibility  and  consistency  for  problems  with  continuous  and/or  piecewise  contin¬ 
uous  controls. 

The  base  Galerkin  optimal  control  method  is  outlined  in  Chapter  4,  where  fea¬ 
sibility  and  convergence  theorems  are  presented.  Chapter  5  presents  a  review  of  addi¬ 
tional  Galerkin-based  formulations  and  strategies  such  as  the  use  element-based  Galerkin 
techniques  and  a  multi-scale  approach.  Lastly,  modifications  to  the  method  such  as  over¬ 
integration  and  the  use  of  various  quadrature  rules  are  offered  to  improve  computational 
efficiency  and/or  increase  accuracy  of  the  solutions. 

The  remainder  of  the  dissertation  is  organized  as  follows:  Chapter  6  presents  a 
Petrov-Galerkin  optimal  control  approach  to  discretizing  the  system  dynamics;  in  place  of 
Lagrange  polynomial  test  functions  integrated  into  the  base  formulation,  a  set  of  Legen¬ 
dre  polynomials  are  used.  Improved  feasibility  and  convergence  theorems  are  presented. 
Chapter  7  demonstrates  the  versatile  nature  of  the  Galerkin  optimal  control  formulations  by 
considering  a  number  of  example  problems.  Lastly,  Chapter  8,  highlights  the  potential  for 
Galerkin  optimal  control  in  solving  a  wide  range  of  real-world  optimal  control  problems 
with  a  variety  of  state  and  control  constraints.  Additionally,  Chapter  8  discusses  areas  of 
future  research. 
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CHAPTER  2: 

MATHEMATICAL  BACKGROUND 


2.1.  Optimal  Control 

Optimal  control  has  a  rich  history  that  dates  back  to  1696,  when  Johann  Bernoulli 
posed  the  bachristochrone  problem  in  the  Acta  Eruditorum  to  [17,  18]  “the  most  astute 
mathematicians  of  the  world.”  The  bachristochrone  problem  was  the  following: 

If  in  the  vertical  plane  two  points  A  and  B  are  given,  then  it  is  required  to 
specify  the  orbit  AMB  of  the  movable  point  M,  along  which  it,  starting  from  A, 
and  under  the  influence  of  its  own  weight,  arrives  at  B  in  the  shortest  possible 
time.  [19] 

In  addition  to  Johann  Bernoulli,  other  mathematical  giants  living  in  Europe  at  this  time, 
such  as  Newton,  Leibniz  and  Johann’s  brother,  Jacob  Bernoulli  [19]  (all  considered  “Men 
of  Mathematics”  by  Bell  [20]),  solved  the  bachristochrone  problem.  Later,  Euler  invented  a 
method  for  solving  such  problems  (with  mathematical  underpinnings  created  by  Lagrange), 
known  today  as  the  foundations  of  the  calculus  of  variations.  The  standard  calculus  of 
variations  problems  is  of  the  form  [21] 

minimize  J[y(-)}  =  [  F(t,y(t),y(t))dt,  (2.1) 

■I  to 

subject  to  y(t0)  =  y0  and  y(tf)  =  yf,  (2.2) 

where  J  acts  on  a  set  of  functions  and  is  called  a.  functional.  Notice  that  problem  (2. 1)— (2.2) 
may  be  written  in  the  equivalent  optimal  control  problem  form 

ftf 

minimize  u(-)]  =  /  F(x(t),u(t))dt ,  (2.3) 

J  to 

subject  to  x(t0)  =  [f0, 2/of,  x(tf)  =  [tf,  yf]T  and  x(t)  =  [1  ,u(t)}T,  t  €  [t0,tf\,  (2.4) 
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by  renaming  the  variables  t  and  y  as  t  —  yi  and  y  =  y2,  and  creating  the  new  vector 

x  =  [yi,  y2] T- 

Over  250  years  later,  after  many  periods  of  active  research  in  the  field  of  calculus 
of  variations,  the  Russian  mathematician  Lev  Semenovich  Pontryagin  made  a  giant  leap 
forward.  In  1956,  Pontryagin  and  his  group  established  the  optimal  control  theory  [22, 
23].  In  contrast  to  standard  calculus  of  variations  problems  of  the  form  (2.1)-(2.2),  or 
equivalent  form  (2.3)-(2.4),  it  was  shown  that  optimal  control  theory  was  well  suited  to 
handle  discontinuous  solutions,  u(t).  Additionally,  Pontryagin  established  that  problems 
of  optimal  control  involved  the  minimization  of  a  functional  over  a  set  of  function  pairs, 
1 1 — y  (. x ,  u)  e  M.Nx  x  M A“ ,  subject  to  the  dynamical  constraint 

x(t)  =  f(x(t),u(t)), 

where  /  :  WNx  x  M;V"  — >  WNx  and  u(t  )  is  a  control  function.  It  was  soon  realized  that  this 
new  theory  of  optimal  control  was  well  suited  to  solve  many  complex  problems  (that  the 
calculus  of  variations  could  not).  Over  the  last  half  century,  optimal  control  theory  has  been 
developed  into  an  extremely  powerful  tool  that  has  touched  many  areas  of  mathematics, 
science  and  engineering.  Consider  the  following  general  problem  of  optimal  control. 

2.1.1.  The  Optimal  Control  Problem 

Determine  the  state-control  function  pair,  1 1->  (x,  u)  e  M.Nx  x  MA”  ,  that  minimizes 
the  cost  functional 

J[x(-),u(-)]=  f  F(x(t),u(t))dt  +  E(x(tf)),  (2.5) 

Jto 

subject  to  the  dynamics, 


x(t)  =  f(x(t),u(t)), 


(2.6) 
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initial  conditions, 


x(t0)  =  x0, 


(2.7) 


at  specified  time,  t0,  and  endpoint  conditions, 

e(x(tf))  =  0,  (2.8) 

where  the  running  (or  Lagrange)  cost  F  :  RNx  x  RNu  — >  R,  the  endpoint  (or  Mayer)  cost, 
E  :  RNx  x  RNx  — >  R,  /  :  RNx  x  R^"  — »  RNx  and  e  :  RNx  x  RNx  — >  RNe,  are  Lipschitz 
continuous  with  respect  to  their  arguments.  A  set  of  necessary  conditions  must  be  met  in 
order  to  find  candidate  solutions  to  problem  (2.5) — (2.8).  Pontryagin’s  Minimum  Principle 
provides  the  necessary  framework. 

2.1.2.  Pontryagin’s  Minimum  Principle 

Pontryagin’s  Minimum  Principle  was  proved  by  Pontryagin  in  1956  [22,  23].  It 
provides  conditions  that  must  be  met  in  order  for  a  solution  to  be  considered  optimal.  As 
with  the  calculus  of  variations  both  necessary  and  sufficient  conditions  for  optimality  may 
be  established.  Although  sufficient  conditions  are  beyond  the  scope  of  this  dissertation 
(see  [24]),  first  order  necessary  conditions  will  be  outlined  with  help  from  the  calculus  of 
variations. 

2.I.2.I.  Calculus  of  Variations 

In  the  calculus  of  variations,  problems  of  the  form  (2.1)-(2.2)  are  solved  by  consid¬ 
ering  the  variation  of  J,  or  A  J,  given  by 

A  J[y*,y]  =  J[y\-J[y*} 

where  y*  is  the  minimizing  curve,  and  y  are  all  other  admissible  curves.  For  y*  to  be  a 
minimizing  curve  it  is  necessary  that  A  J[y*,y\  >  0.  Additionally,  if  all  the  first  order 
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terms  are  collected  in  the  expansion  of  A  J,  it  is  necessary  that  this  collection  (called  the 
first  variation  or  shown  symbolically  as  8J[y*\)  must  be  equal  to  zero  [21].  This  same 
approach  may  be  used  to  define  the  optimality  conditions  for  problem  (2.5)— (2.8). 


2.I.2.2.  Necessary  Conditions 

In  order  to  apply  a  variational  approach  to  problem  (2.5)-(2.8),  consider  the  aug¬ 
mented  functional, 

«(■),  A(-),  KO]  =  [  (-F(z(£),  «(*))  +  -  x(t)))  dt 

Jt0 

+E(x(tf ))  -  uTe(x(tf)), 

where  A (t)  E  WNx  and  v(t)  G  M;V'  are  Lagrange  multipliers,  and  X(t)  is  typically  given  the 
name  costate  or  adjoint  covector.  As  in  the  calculus  of  variations  approach,  considering  the 
first  variation,  5Ja[u*}  =  0,  a  set  of  necessary  conditions  can  be  obtained  [21,  25-27] 

(2-9) 
f)  M 

AW=-^.  (2.10) 

where  the  Hamiltonian,  H,  is  given  by 

H(x(t),u(t),X(t))  =  F(x(t),u(t))  +  A  Tf(x(t),u(t)).  (2.11) 


Additionally,  the  Hamiltonian  (2.1 1)  reaches  its  minimum  with  respect  to  u  at  u  =  u*.  This 
is  called  the  Hamiltonian  Minimization  Condition  and  can  be  expressed  as 

u*  =  arg  max  H(x(t),  u(t),  A (£)),  (2.12) 

«eu 
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where  U  defines  a  region  of  feasible  control.  Finally,  the  following  conditions  must  be 
satisfied  on  the  boundary 


Ktf) 

H(tf) 


d  E 
dxf'_ 
d  E 


e(x(tf ))  =0, 


where  the  endpoint  Lagrangian,  E,  is  given  by 

E(x(tf),  v)  =  E(x(tf ))  +  uTe(x(tf)). 


(2.13) 

(2.14) 

(2.15) 

(2.16) 


Equations  (2.9),  (2.10),  (2.12)  and  (2. 13)— (2. 15)  provide  the  first-order  necessary  condi¬ 
tions  for  optimality  and  create  the  framework  for  Pontryagin’s  Minimum  Principle. 

2.I.2.3.  Pontryagin’s  Minimum  Principle 

Lemma  2.1  (Pontryagin’s  Minimum  Principle).  [26]  Let,  (x*(t),u*(t)),  be  a  solution  to 
problem  (2.5)-(2.8).  Then  in  order  for  x*(t)  and  u*(t )  to  be  optimal,  it  is  necessary  that 
there  exists  a  costate,  \,  and  covector,  v,  that  satisfies  conditions  (2.9),  (2.10),  (2.12)  and 
(2.13)-(2.15). 

Remark  2.1.  For  problem  (2.5)-(2.8),  with  added  path  condition,  the  following  mixed 
state-control  inequality  path  constraint  is  included, 


h(x(t),u(t ))  <  0, 


(2.17) 


where  h  :  RNx  x  M;V"  — >  RNh  is  Lipschitz  continuous  with  respect  to  x  and  u. 
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With  the  addition  of  the  path  constraint,  candidate  solutions  to  problem  (2.5)-(2.8) 


and  (2.17)  can  be  found  by  solving  the  nonlinear  programing  (NLP)  problem 

u*  =  arg  max  H(x(t),u(t),  X(t),  p(t)), 

u£U(x) 

where  the  constraint  set,  U  C  K;Vu,  is  given  by 

U(x)  =  {u6  ll\h(x(t),  u(t))  <0,x  E  M.Nx,t  E  [to,tf]}. 

The  augmented  Hamiltonian  (or  Lagrangian  of  the  Hamiltonian),  II,  is  given  by 

H(x(t),u(t),  X (t),p(t))  =  H(x(t),u(t),  X (t))  +  pTh(x(t),u(t)), 

where  p(t )  E  RNh  are  Lagrange  multipliers.  The  modified  set  of  necessary  conditions 
are  [27,  28] 


.  .  ,  dH 
X{t)  =  9A  ’ 

: ,  ,  dH 
Mt> = -  to  - 


along  with  the  following  conditions  on  the  boundary 


Ktf) 

H(tf) 


dE 


dxf ’ 


dE 
dtf ’ 


e(x(tf))  =0. 


Additionally,  the  complementary  (slackness)  condition, 


(2.18) 

(2.19) 


(2.20) 

(2.21) 

(2.22) 
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(2.23) 


<0,  hi(x(t),u(t))  =  0, 

Ik  < 

|^=0,  hi(x(t),u(t))  <  0, 

must  be  satisfied. 

Equations  (2.18)-(2.23)  provide  the  first-order  necessary  conditions  for  optimality 
for  problem  (2.5)-(2.8)  and  (2.17). 

Although  Pontryagin  provided  a  framework  for  finding  candidate  optimal  solutions, 
many  problems  of  optimal  control  are  too  difficult  to  solve  analytically.  It  is  easy  to  see  the 
difficulty  in  solving  the  2 Nx  Hamiltonian  system  of  differential  equations  (2.9)-(2.10)  or 
(2.18  )— (2. 1 9).  For  this  reason,  numerically  methods  have  become  extremely  important  in 
solving  optimal  control  problems. 

2.1.3.  Numerical  Methods  for  Optimal  Control 

Many  numerical  techniques  have  been  investigated  for  solving  optimal  control  prob¬ 
lems  since  Pontryagin  proved  the  Minimum  Principle  in  1956.  These  optimal  control  meth¬ 
ods  take  two  main  forms,  indirect  and  direct.  Recent  surveys  of  these  techniques  are  pro¬ 
vided  by  Betts  [29,  30],  Trelat  [31]  and  Ross  [32]  and  a  historical  perspective  by  Stryk  et 
al.  [33].  Indirect  methods  (such  as  the  shooting  and  multiple  shooting  methods)  solve  Pon- 
tryagin’s  necessary  conditions  for  optimality.  Although  these  methods  have  been  shown 
to  solve  a  wide  range  of  problems  with  great  accuracy,  they  prove  to  be  difficult  to  im¬ 
plement,  due  to  the  knowledge  of  the  calculus  of  variations  required  and  the  difficulty  of 
providing  good  initial  guesses.  In  contrast,  the  direct  methods  (such  as  Euler,  Runge-Kutta 
and  collocation  methods)  discretize  the  cost  function,  problem  dynamics,  etc,  at  specified 
time  points.  Due  to  the  fact  that  direct  methods  require  no  knowledge  of  the  necessary  con¬ 
ditions  for  optimality,  and  the  accuracies  that  may  be  obtained,  they  have  recently  gained 
much  attention.  Of  the  direct  methods,  specifically  the  global  orthogonal  collocation  meth¬ 
ods  (a.k.a.  pseudospectral  methods)  have  proven  to  solve  difficult  problems  with  great  ac- 
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curacy  [4,  8,  34,  35]  after  becoming  an  actively  researched  topic  in  the  1990s  by  Elnagar  et 
al.  [1]  and  Fahroo  et  al.  [2], 

Pseudospectral  (PS)  methods  for  optimal  control  discretize  the  problem  at  speci¬ 
fied  nodes,  called  collocation  points.  Due  to  the  properties  of  the  orthogonal  family  of 
collocation  points  (such  as  those  found  via  the  Legendre  or  Chebyshev  polynomial  ba¬ 
sis)  approximations  converge  at  spectral  rates  [6].  The  most  widely  used  Legendre  PS 
method  [1,  5,  6,  9,  10]  is  based  on  the  LGL  points  [36].  However,  the  Legendre  PS  method 
may  be  based  upon  LGR  or  LG  nodes  as  well  [36,  37].  PS  methods  for  optimal  control 
have  been  formally  implemented  in  the  MATLAB-based  software  package  DIDO  [38]  and 
NASA’s  Fortran-based  package  OTIS  [39]. 

There  are  four  parts  to  the  numerical  solution  to  an  optimal  control  problem  us¬ 
ing  a  PS  method:  discretization  of  the  system  dynamics,  discretization  of  the  state-control 
constraints,  integration  of  the  cost  function  and  solving  the  nonlinear  program  (NLP).  The 
mathematical  background  associated  with  the  first  three  steps  will  be  discussed  in  the  fol¬ 
lowing  sections.  Spectral  methods  are  attractive  for  discretizing  the  problem’s  dynamics 
due  to  their  superior  accuracy.  Two  global  spectral  methods,  collocation  and  Galerkin, 
will  be  outlined  in  Section  2.3.1.  Additionally,  Galerkin  methods  may  be  formulated  as 
element-based  methods.  These  local  spectral  element  methods  will  be  outlined  in  Sec¬ 
tion  2.3.2.  A  fundamental  task  in  the  formulation  of  these  global  and  local  methods  is  the 
selection  of  good  discretization  points  and  the  use  of  interpolating  functions.  Both  will  be 
discussed  in  detail  in  Section  2.2.  Finally,  numerical  integration,  or  quadrature,  is  typically 
used  to  integrate  the  cost  function  and  will  also  be  outlined  in  Section  2.2. 

The  resulting  NLP  can  be  solved  by  using  a  commercial  sequential  quadratic  pro¬ 
gramming  (SQP)  software  packages  such  as  dense  NLP  solver  NPSOL  [40]  and  sparse  NLP 
solvers  SNOPT  [41,  42]  and  SPRNLP  [43].  A  feasible  solution  can  be  found  that  satisfies 
the  tolerances  specified  in  the  optimization  problem  by  adjusting  the  order  of  polynomial 
used  in  the  approximation. 
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2.2.  Interpolation  and  Numerical  Integration 

Interpolation  and  numerical  integration  serve  an  important  role  in  the  methods  out¬ 
lined  in  this  dissertation  and  will  be  discussed  in  this  section.  The  general  structure  of  this 
section  follows  that  provided  by  Giraldo  in  Chapters  4  and  5  of  [44]. 

2.2.1.  Interpolation 

Polynomial  interpolation  is  the  method  used  to  construct  an  iV-th  order  polynomial, 
or  interpolant,  xN (t).  that  approximates  a  function,  x(t).  This  is  typically  done  by  ensuring 
the  interpolant  passes  through  the  N  + 1  known  points,  {(U,  Xj)}^0,  so  that  x(U)  =  xN(U), 
for  i  —  0, 1, . . . ,  N.  This  may  be  accomplished  by  using  a  finite  sum  such  as 

N 

xN(t)  =  £>,(<)%.  (2-24) 

3=0 

where  {xj}f=0  are  the  expansion  coefficients  and  { }yC0  are  the  basis  functions.  Defining 
the  basis  functions,  {<!>,■  }^0,  as  modes  (such  as  Legendre  polynomials)  leads  to  modal 
type  of  interpolation.  However,  defining  the  basis  functions  in  a  nodal  fashion  such  that 

=  8ij,  for  i,  j  =  0, 1, . . . ,  N,  where 

-  ('  i=i' 

[o,  i  ji 

(such  as  Lagrange  polynomials)  leads  to  nodal  interpolation. 

2.2.1. 1.  Modal  Interpolation 

In  modal  interpolation,  the  basis  functions,  in  Equation  (2.24)  are  typi¬ 

cally  orthogonal  polynomials  and  the  eigenfunctions  of  the  singular  Strurm-Liouville  prob¬ 
lem.  Commonly  used  polynomials  are:  Legendre,  Chebyshev,  Fourier  and  Jacobi.  For  this 
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discussion  the  Legendre  polynomial,  L(t),  will  be  the  focus,  defined  by  [45] 

L’®={-w^{(i-t2y)'  <2-25) 

and  therefore  will  be  the  chosen  basis.  The  Legendre  polynomials  result  from  the  special 
case  of  the  singular  Strurm-Liouville  problem  [45], 

it  ((1  -  ^^tr) +iU + 1)Li(t) = °- 

Figure  1  shows  the  first  seven  Legendre  polynomials.  The  spectral  coefficients,  {aj}°^0. 


Figure  1:  Legendre  polynomials,  {Ln(£)}®=0. 
for  the  continuous  Legendre  expansion,  are  defined  as  [46] 

1  r1 

a,j  =  —  /  x(t)Lj(t)dt, 

7 j  4-1 


(2.26) 
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where  the  normalizing  constants,  {7j}°L0’  f°r  the  Legendre  polynomials  are  given  by 

2 

13  ~  2  j  +  1 

The  truncated  Legendre  modal  expansion  is  then  given  by 

N 

xN(t)  = 

3=0 

However,  due  to  the  interpolatory  nature  of  xN,  it  is  natural  to  seek  spectral  coefficients, 
{a,j}jL0,  defined  by 

N  N 

xN(U)  =  YMti)aj  =  YViM,  (2.29) 

3=0  j= 0 


for  the  known  points  (f where  V  is  the  generalized  Vandermonde  matrix  given  by  [47] 


^  Lo(to ) 

Li(fo) 

L]y(to)  ^ 

V  = 

Lo(fi) 

LN{ti) 

(2.30) 

y  Lo(tN) 

L\  (f  jv)  ■ 

■  ■  Ljsr(t]sr)  y 

From  Equation  (2.29),  the  modes,  {cij 

}jL0>  and  the  nodes,  {xNj}f=0, 

are  related  by  the 

generalized  Vandermonde  matrix  (2.30)  by  the  relationships  [47] 

N 

xNj  =  YViiai 

3=0 

and 

N 

(2-31) 

3=0 


(2.27) 


(2.28) 
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where  xNj  =  xN(tj )  for  j  —  0, 1, . . . ,  N. 

Remark  2.2.  Note  that  to  form  Equation  (2.31)  the  Vandermonde  matrix  (2.30)  must  be 
invertible  and  therefore  nonsingular  (and  more  practically  speaking,  well-conditioned).  We 
will  see  that  the  invertibility  of  the  Vandermonde  matrix  is  dependent  upon  the  interpolation 
quality  of  grid,  (see  Section  2. 2. 1.4  for  grid  selection)  [47].  Additionally,  we  take 

comfort  in  the  fact  that  the  set  of  Legendre  polynomials,  {Lj}°S0,  is  an  orthogonal  system 
that  has  shown  to  produce  well-conditioned  Vandermonde  matrices  for  carefully  selected 
nodes  (as  compared  with  the  ill-conditioned  Vandermonde  matrices  of  non-orthogonal  sys¬ 
tems  such  as  the  the  power  basis,  \  tn}  f=(i)  [48]. 

Note  that  Equation  (2.28)  is  a  sum  of  frequencies,  {Lj}jL  0,  and  amplitudes,  {aj}jL0, 
that  together  compose  the  (N 4-1)  modes  of  xN  (t  ) .  It  is  thus  fitting  to  describe  this  approach 
as  modal  interpolation. 


2.2.I.2.  Nodal  Interpolation 

In  nodal  interpolation,  the  basis  functions,  {E,}^,  in  Equation  (2.24)  are  the  La¬ 
grange  polynomials,  {(j)f}^=0,  of  order  N,  defined  on  grid  {fy}^L0,  obtained  from  the  gen¬ 
eral  definition  [45] 

«<)  =  rijTZTT  <232> 

i= 0  '  1  l' 


Additionally,  the  Lagrange  polynomials  may  be  defined  in  terms  of  the  Legendre  polyno¬ 
mial  by  [46] 


1  jt2-l)LN(t) 
N(N  +  1)  (t~t3)LN(t3f 


(2.33) 


Ligure  2  shows  the  order  N  =  6  Lagrange  polynomials,  defined  on  an  equi- 

spaced  grid,  t  G  [—1,1]. 
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Figure  2:  Lagrange  polynomials  of  order  N  =  6  defined  on  an  equi-spaced  grid. 

The  nodal  interpolation  of  the  function  x(t)  can  be  accomplished  by  the  iV-th  order 
expansion 

N 

xN(t)  =  Yjfl  (2.34) 

j=o 

where  =  xN(tj ),  for  j  =  0, 1, . . . ,  N,  since  the  Lagrange  polynomial  has  the  property. 

=  Si:i. 

Additionally,  the  Legendre  polynomial,  Lj(f),  of  order  i  can  be  written  as  linear 
combinations  of  Lagrange  polynomials.  {<^(f)}£L0,  of  order  N  defined  on  grid.  {fi}£L0, 
by  the  relationship  [47] 

N  N 

U(t)  =  (2.35) 

j=0  3=0 
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Likewise,  the  Lagrange  polynomial  may  be  written  as  linear  combinations  of  Legendre 
polynomials  by  the  relationship 


N  N 

=  E  (V%  =  E^TW-  (2-36) 

3= o  j= o 

From  Equations  (2.3 1)  and  (2.36)  we  can  relate  the  modal  and  nodal  forms  of  the  interplant, 

xN(t),  by 


N 

xN(t )  =%2<i>ir(t)xNt  = 

i= 0 

N  /  N 


N  /  N 

E  E ’EM*) 


i=0  \i=0 


N 


X 


Ni 


=ELiW  EL1*1"  =ELiW%- 

j=0  \  i=0  /  j=0 


Therefore,  Equation  (2.34)  is  truly  a  norir//  representation  of  Equation  (2.28). 


2.2.I.3.  Transformations  between  grids 

Consider  the  problem  of  transforming  between  two  different  grids,  {tj}jL0  and 
{Tj}jL0,  where  M  <  N.  Let  {of}^LQ  be  the  set  of  Lagrange  polynomials  of  order  M 
defined  on  grid  {Tj}f=  0.  Also,  let  the  function  xM it)  be  the  Lagrange  interpolating  poly¬ 
nomial 

M 

xM(t)  =  E 

where  xMj  =  xm(tj),  for  j  —  0, 1, . . . ,  N,  since  dfir,)  =  <),r  Then  the  approximation  of 
xM  at  the  dense  gridpoints,  {L.-}/~0,  can  be  calculated  with  the  linear  transformation 

M  M 

xM(U)  =  =  EL'"*'"’  *  =  0,1 

3=0  3=0 


16 


where  the  (N  +  1)  x  (M  +  1)  linear  mapping,  TNM,  is  given  by 


rjxNM  _ 


0oM(*o) 


0f(to) 

<(*i) 


K(to) 


(2.37) 


%  An) 


K(tr 


In  a  similar  fashion,  the  approximation  of  x  may  be  transformed  between  two  grids. 
Note  that  the  approximation  of  x  on  grid  {ry  }^0  may  be  given  by 


i(t)  --  xM(t)  = 


where  the  derivative  of  the  Lagrange  polynomial  is  defined  as 


k^=j  i^j 

i^k 


(2.38) 


Then  the  approximation  of  xM  at  the  dense  gridpoints,  {tj }JL0,  can  be  calculated  with  the 
with  the  linear  transformation 


xM(U)  =  (U)xMj  =  ^TA%mxMi,  i  =  0, 1, . . . ,  N, 


where  the  (N  +  1)  x  (M  +  1)  linear  mapping,  ANM,  is  given  by 


Anm  — 


0f(to) 


<{h) 


(2.39) 


?o  An) 


ML(ti 
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Using  the  transformation  matrices  (2.37)  and  (2.39)  to  relate  different  grids  will 
serve  as  an  important  tool  for  the  multi- scale  approximation  methods  outlined  in  Chapters  3 
and  5.  However,  the  accuracy  of  interpolation  is  extremely  important  and  will  be  discussed 
next. 

2.2.I.4.  Interpolation  Quality 

Approximation  quality  is  a  great  concern  when  using  interpolation.  The  goodness 
of  the  approximation  xN(t)  is  directly  related  to  the  grid  points,  {tj}fL0,  from  which  the 
Lagrange  polynomials,  {(j)^}f=0,  are  defined.  A  measure  of  interpolation  goodness  is  the 
Lebesque  constant,  AN,  given  by  [45] 

N 

AN  =  max  Y'' Uf  (t)  I .  (2.40) 

The  best  interpolating  polynomial  xN(t)  is  one  that  minimizes  the  Lebesgue  constant  (2.40), 
due  to  the  following  result  [45], 

|| x(t)  -  xN (t)\\Lao  <  (1  +  Ajv)||a;(t)  -p{t)\\LOO, 

where  p(t)  is  the  best  approximating  polynomial  of  x(t)  in  the  L°°-norm  (see  Appendix  A). 
From  [45,  49],  for  any  set  of  (N  +  1)  distinct  points,  C  e  [—1, 1],  for  i  =  0.  1, . , .V,  the 
Lesbegue  constant  (2.40)  has  the  lower  bound  [45], 

2 

-log(iV  +  1)  +  a  <  AjV; 

7T 

where  a  =  |  (7  +  log-)  ~  0.521  and  7  =  0.57721566...  is  the  Euler-Mascheroni  con¬ 
stant.  So,  at  best  the  selected  grid  is  associated  with  a  Lebesgue  constant  that  grows 
logarithmically  [45].  Common  Legendre  family  of  points  used  for  interpolation  are  the 
Legendre-Gauss  (LG),  Legendre-Gauss-Lobatto  (LGL)  and  Legendre-Gauss-Radau  (LGR) 
points. 
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Legend  re-Gauss  Points.  The  LG  points,  {t,}^0,  are  defined  by  —  1  <  to  <  •  •  •  <  fjv  <  1, 
and  are  the  roots  of 


m  =  LNn{t),  (2.41) 

where  Ljv+i(£)  is  the  (N  +  l)-th  order  Legendre  polynomial.  Note  that  the  LG  points  do 
not  include  the  endpoints,  t  =  ±1.  Figure  3  shows  the  LG  points  for  various  orders  of  N. 


-in  II  1  ;  1  1 1 — rl — 1 - H 

j  1 

H - 1 - 

. _i_ . _| 

- 1 - h- 

H — 1 — h 

rH - 1  1  ;  1  I  Mil- 

j 

j _ i _ L _ J _ jj 

_ _4 

U _ i 

j 

i  ii _ i  i 

1^  *  -CT5  ’ 

T  65  '  i 

Figure  3:  LG  points  for  N  —  10,  20  and  .30. 


Legendre-Gauss-Lobatto  Points.  The  LGL  points,  {fi}^0,  are  defined  by  £o  =  —  1  < 
ti  <  •  •  •  <  tN  =  1,  and  are  the  roots  of 

f(t)  =  (l-f2)Ljv(t),  (2.42) 

where  Ljv(f)  is  the  derivative  of  the  iV-th  order  Legendre  polynomial.  Note  that  the  LGL 
points  include  the  endpoints,  t  =  ±1.  Figure  4  shows  the  LG  points  for  various  orders  of 
N. 
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Figure  4:  LGL  points  for  N  =  10, 20  and  30. 


Additionally,  Figure  5  shows  the  order  N  =  6  Lagrange  polynomials,  <f>N,  defined  on  a 
LGL  grid. 


Figure  5:  Lagrange  polynomials  of  order  N  =  6  defined  on  a  LGL  grid. 


Legendre-Gauss-Radau  Points.  The  LGR  points.  {f,}£Lo’  are  defined  by  to  =  —  1  <  U  < 
•  •  •  <  fjv  <  1,  and  are  the  roots  of 


£(t)  —  LN+i(t )  4-  LN(t). 


(2.43) 
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Note  that  the  LGR  points  only  include  the  endpoint,  t  =  —1.  Figure  6  shows  the  LGR 
points  for  various  orders  of  N. 


Figure  6:  LGR  points  for  N  —  10,  20  and  30. 

Additionally,  flipped-LGR  (F-LGR)  points  are  the  negative  of  the  LGR  points  and  are  there¬ 
fore  defined  by  —1  <  t0  <  -  ■  ■  <  tN  =  1.  Note  that  the  F-LGR  points  only  include  the 
endpoint,  t  —  1. 

Although  all  three  sets  of  Legendre  points  (LG,  LGL  and  LGR)  have  Lebesgue 
constants  (2.40)  that  grow  logarithmically  or  sublinearly  with  N,  the  LGL  grid  is  asymp¬ 
totically  associated  with  the  near  optimal  Lebesgue  constant  [45], 


At  <  -log (N  +  1)  +  0.685... 

7 T 


As  alluded  to  in  Remark  2.2,  the  quality  of  interpolation  can  also  be  observed  by 
analyzing  the  conditioning  of  the  Vandermonde  matrix  (2.30).  Due  to  the  relationship 
between  the  Legendre  polynomials,  {Lj}^=  0,  and  Lagrange  polynomials,  {< pf}^=0 ,  shown 
in  (2.35),  Cramer’s  rule  [50]  provides  the  following  relationship 


,jv  _  Pet  [L((0), . . . ,  L(fj_i),  L((),  L . . . ,  L (tN)] 
9 3  Pet  [VT] 


(2.44) 


where  L(f)  =  [L0(t),  Li(t), . . . ,  LN(t)]T .  As  pointed  out  by  Hesthaven  et  al.  [47],  if  the 
goal  is  to  minimize  the  Lebesque  constant  (2.40),  we  should  strive  to  maximize  the  de¬ 
nominator  of  Equation  (2.44),  Det  [VT\ .  This  leads  to  the  LGL  grid  set  [51].  Additionally, 
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the  Chebyshev-Gauss  family  of  points  proves  to  have  excellent  interpolation  quality,  par¬ 
ticularly  the  Chebyshev-Gauss-Lobatto  points  when  measured  by  the  Lebesgue  constant 
growth  [45.  52],  However,  the  focus  in  this  dissertation  will  be  on  the  Legendre  basis,  and 
thus  the  LG.  LGL  and  LGR  points. 

Unfortunately,  equi-spaced  points  prove  to  be  a  veiy  poor  grid  selection  for  inter¬ 
polation.  The  Lebesgue  constant  for  the  equi-spaced  points  grows  asymptotically  like  [52, 
53], 

oJV+l 

*  ES  ^  - _ 

eJV(log  N  4-  7)  ’ 

very  far  from  optimal. 

As  an  example  of  interpolation  quality  consider  the  function 

/(f)  =  cos(/T7Tf),  f  £  [-1, 1],  (2.45) 

with  n  =  3,  shown  in  Figure  7. 


Figure  7:  Plot  of  /(f)  =  cos(37rf). 


When  low  order  approximations  are  used  to  interpolate  /(f),  inaccuracies  are  appar¬ 
ent.  Figure  8  shows  the  inaccuracies  in  the  10-th  order  Lagrange  interpolating  polynomial 
approximation  of  /(f)  with  LGL  points. 


Figure  8:  Interpolation  of  /(f)  =  cos(37rf)  with  10-th  order  LGL  points. 
However,  as  the  interpolation  order,  N,  is  increased,  the  maximum  error, 

llerror|loo  =  ||/(*0  -  /"ML*  *  =  0, 1, . . . ,  JV, 

decreases  exponentially  with  N,  where  ||  £  represents  the  maximum  element  of  vector, 
£  £  Rn.  Figure  9  shows  the  visual  accuracy  of  the  30-th  order  Lagrange  interpolating 
polynomial  of  /(f)  with  LGL  points. 
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Figure  9:  Interpolation  of  f(t )  =  cos(37t£)  with  30-th  order  LGL  points. 

Additionally,  Figure  10  compares  interpolation  of  f(t )  with  equi-spaced,  LG,  LGL 
and  LGR  points,  for  various  orders  of  N.  Notice  that  for  LG,  LGL  and  LGR  points,  the 
maximum  interpolation  error  drops  to  O(10-15)  by  N  =  35.  However,  in  general,  the 
equi-spaced  points  prove  to  have  very  poor  interpolation  quality. 
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Figure  10:  Comparison  of  interpolation  errors  for  various  orders  of  N  and  equi-spaced, 
LG,  LGL  and  LGR  points. 


Due  to  the  accuracy  of  LGL  interpolation,  and  inclusion  of  the  endpoints,  t  =  ±1, 
LGL  points  are  readily  used  for  numerical  computation.  However,  also  related  to  point 
selection  is  the  accuracy  of  numerical  integration.  This  is  an  important  factor  for  the  direct 
methods  for  optimal  control  (due  to  the  cost  function  that  is  normally  approximated  by 
numerical  integration)  and  will  be  discussed  next. 


2.2.2.  Numerical  Integration 

Numerical  integration,  or  quadrature,  is  a  way  of  approximating  an  integral  with  a 

sum 


,i  at 

/  x(t)dt  »  ^2x(tk)wk, 
J -i  k= o 
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where  {«'/,•} /)10  are  the  quadrature  weights  and  {tk}^=0  are  the  associated  points.  Ideally, 
the  numerical  integration  is  exact,  but  it  is  reasonable  to  expect  an  error,  eN,  such  that 

„1  N 

eA  =  x(t)dt  —  y \(tk)wk. 

J-1  k= 0 

For  the  special  case  of  the  numerical  integration  of  general  function  x  G  CN+\  that  is 
approximated  by  the  Lagrange  interpolating  polynomial 

N 

x(t)  ~  XN(t)  = 

j= 0 

the  error  may  be  given  by  [54] 


eN  = 


(N  +  1)\  J _ 


f-l  N 

/  W-ti 

J~1  j= 0 


d,(N+Vx(£(t)) 
1  dt(N+V 


dt, 


for  arbitrary  function  £(£)  G  [—1,1]. 

However,  for  certain  classes  of  polynomial  functions,  the  quadrature  error  is  zero. 
Consider  the  function  x(t)  G  Pn,  represented  as  the  finite  sum 


N 

x(t)  =  ^2<j)"(t)xNj. 

j= 0 

Performing  numerical  integration  on  x  results  in 


( i>f(t)dt 


N  N  N 

=  yy^y ^(/)f(tk)wk  =  Y^’wj. 

j= 0  fc=0  j= 0 
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Clearly,  since 


(2.46) 


quadrature  is  exact  \/x(t)  G  Pn ■  Additionally,  the  quadrature  weights,  {wj}jL0,  can  be 
found  with  the  relationship 


Wj  = 


(2.47) 


2.2.2. 1.  Gaussian  Quadrature 

Consider  now  the  case  that  x(t )  G  P2tv+i  written  in  the  form 


x{t)  =  LN+1(t)f(t)  +  g(t), 


where  /,  g  G  Pn,  -C/v+i  is  the  Legendre  polynomial  of  order  (N  +  1)  and  PN  denotes 
the  space  of  all  polynomials  of  degree  <  N.  Also  consider  the  (N  +  1)  points,  {C}/)=0, 
that  are  the  roots  of  LN+1(t)  (known  as  LG  points,  discussed  in  Section  2.2.1),  and  the 
associated  quadrature  weights,  {wk}k=0  (known  as  LG  quadrature  weights,  found  via  the 
general  definition  (2.47)  or  the  more  specific  definition  (2.48)).  Then  x(tk)  =  g(tk),  for  all 
k  —  0, 1, . . . ,  N,  and 

J  x(t)dt  —  J  (LN+1(t)f(t)  +g(t))dt 

N  N 

=  LN+i(tk)f{tk)wk  +  ^2g(t  k)wk 

k= 0  k= 0 

N  N 

=  y 'g(tk)wk  =  y \(tk)wk. 

k= 0  k= 0 

This  is  known  as  LG  quadrature  (or  simply  Gauss  quadrature),  which  is  exact  \/x(t)  G 
P‘2N+\-  In  the  case  that  the  function  x(t)  is  a  polynomial,  such  that  x(t)  G  P2N+5,  numerical 
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integration  is  exact  for  LGL  and  LGR  quadrature,  where  5  =  —  1  and  0,  respectively.  The 
proof  of  LGL  and  LGR  quadrature  exactness  for  polynomials  is  similar  to  that  given  above 
for  LG  quadrature.  The  list  of  LG,  LGL  and  LGR  weights  are  presented  below  (provided 
by  [47])  and  point  locations  are  given  by  Equations  (2.41),  (2.42)  and  (2.43),  respectively. 
Legendre-Gauss  Quadrature.  The  LG  quadrature  weights,  {wk}^=0,  are  given  by 


[1  -(4)2][W(4)]2' 

Legendre-Gauss-Lobatto  Quadrature.  The  LGL  quadrature  weights,  {wk}%=0,  are  given 
by 


u>k 


2  1 
N(N  +  1)  [Ljv(ffc)]2 


(2.49) 


Legendre-Gauss-Radau  Quadrature.  The  LGR  quadrature  weights,  {wk}^=0,  are  given 
by 


1  1-4 

(N  +  l)2  [LN(tk)}2 


(2.50) 


Additionally,  the  F-LGR  quadrature  weights,  {wfc}^=0,  are  the  reordered  LGR  quadrature 
weights,  {wk}£=0  =  {wN_k}%=0. 


2.2.2.2.  Gaussian  Quadrature  Accuracy 

The  Legendre-Gaussian  family  of  quadrature  is  widely  used  due  to  its  integrating 
accuracy  and  has  the  following  error  estimates  for  LG,  LGL  and  LGR  quadrature,  respec¬ 
tively  [55] 
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fN 

tLG 


22iV+3[(iV  +  i)!]4  d(2N+ 2)x^ 

'(2N  +  3)[(2N  +  2)!]3  dt(2N+ 2) 


,  [-1,1], 


fN 

lLGL 


-(IV  +  l)jV322JV+1[(lV  -  l)!]4  S2Nh(0 


(2N  +  1)[(27V)!]3 


dt(2N) 


,  [-1,1], 


- LGR 


2 2N+1(N  +  1) (iV!)4  S2N+Vx(0 
'{2N  +  l)!]3  M  2N+V 


,  e  e  [-1,1]- 


Consider  again  the  function  (2.45),  with  /i  =  1/2.  As  an  example  of  quadrature 
accuracy,  consider  the  numerical  approximation, 


cos 


'-i 


7 rt 


N 


dt  ~  yr(tk)wk, 


(2.51) 


k= 0 


where 


N 

nt)  ~  sN(t )  = 


3=0 


and  fNj  =  cos(^),  for  j  —  0, 1, . . . ,  N.  Due  to  the  property  of  the  Lagrange  polynomial, 
oj (ti)  =  Sij,  the  quadrature  expression  in  (2.51)  takes  the  form  (2.46), 


N 


N 


yntk)wk  =  yfNki 


wk. 


k= 0 


k= 0 


As  with  interpolation  error,  quadrature  error  decreases  exponentially  with  N.  Fig¬ 
ure  1 1  shows  numerical  integration  error, 


error  = 


»i 


cos ( ) dt  -  »s(^K 

1  k= 0 
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for  LG,  LGL  and  LGR  quadrature,  all  of  which  demonstrate  similar  exponential  conver¬ 
gence  as  N  increases. 


Figure  1 1 :  Comparison  of  quadrature  errors  for  various  orders  of  N  and  LG,  LGL  and  LGR 
points. 


Due  to  the  accuracy  of  quadrature,  quality  of  interpolation  and  inclusion  of  the 
endpoints,  t  =  ±1,  LGL  points  are  an  important  part  of  many  numerical  computation  ap¬ 
plications,  to  include  direct  methods  for  optimal  control.  For  instance,  in  the  Legendre  PS 
method  the  problem  state-control  constraints  are  discretized  using  LGL  points.  Addition¬ 
ally,  the  states  are  approximated  with  globally  interpolating  Lagrange  polynomials  defined 
on  an  LGL  grid  and  the  cost  function  is  approximated  using  LGL  quadrature  rule.  LGL 
interpolation  and  quadrature  also  serve  an  important  role  in  the  construction  of  the  Galerkin 
optimal  control  formulations  discussed  in  this  dissertation. 

A  crucial  step  in  solving  optimal  control  problems  with  direct  methods — and  yet  to 
be  discussed — is  the  discretization  of  the  system  dynamics.  Although  a  number  of  tech¬ 
niques  have  been  investigated  to  do  this,  spectral  methods  have  proven  to  be  effective  and 
efficient.  This  is  a  highlight  in  the  PS  direct  methods  for  optimal  control,  where  collocation 
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methods  are  used  to  discretize  system  dynamics.  Spectral  methods  also  serve  as  the  heart 
of  Galerkin  optimal  control,  where  Galerkin  methods  are  employed.  Both  collocation  and 
Galerkin  methods  will  be  discussed  in  the  next  section. 

2.3.  Numerical  Solutions  to  Differential  Equations 

Spectral  methods,  which  have  gained  much  popularity  due  to  their  spectral  accu¬ 
racy  and  versatility  [56,  57],  can  be  formulated  for  both  local  (element-based)  and  global 
approximations.  Consider  the  task  of  discretizing  the  dynamics  of  problem  (2.5)— (2.8), 

x(t)  =  f(x(t),u(t)),  t  G  [t0,t/l,  (2.52) 

where  t  i— >•  (x.  u )  G  x  WNu,  with  initial  conditions, 


x(t0)  =  x0, 


(2.53) 


and  endpoint  conditions, 


e(x(tf ))  =  0,  (2.54) 

where  /  :  RNx  x  RNu  — >  RNx  and  e  :  RNx  x  RNx  — >•  RNe,  are  Lipschitz  continuous  with 
respect  to  their  argument.  This  can  be  accomplished  using  a  number  of  spectral  method  for¬ 
mulations.  However,  two  methods  will  be  discussed  here,  collocation  and  Galerkin.  Addi¬ 
tionally,  of  the  spectral  element  methods,  continuous  and  discontinuous  Galerkin  element- 
based  formulations  will  be  outlined. 
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2.3.1.  Spectral  Methods 

The  starting  point  for  the  spectral  approximation  of  Equation  (2.52)  is  to  approxi¬ 
mate  the  solutions  x  and  u  by  the  finite  sums 

N 

3=0 

N 

U(o  « uN(t)  = 

3=0 

where  x3  and  u3  are  expansion  coefficients  and  (1>3  and  <3>;  are  basis  functions.  In  terms  of 
the  approximation  xN  and  uN,  Equation  (2.52)  becomes 

“"«))  =  <2-55> 

where  At  =  tf  —  t0  and  eN  is  the  error  (or  residual)  in  the  approximation  which,  gener¬ 
ally,  is  not  zero.  The  relationship  between  the  physical  time  domain,  t  G  [to,  t /],  and  the 
computational  space,  £  G  [—1, 1],  is  given  by 

2  2 
s  A  tK  '  At 

and  conversely, 

t  —  ^  (£  +  1)  +  t0  and  dt  =  ^ -d £. 

2.3.1. 1.  Collocation 

In  the  collocation  method  the  basis  functions,  are  the  Lagrange  polyno¬ 
mials  (2.32),  {0^}^o,  of  order  N,  defined  on  the  grid  of  collocation  points,  G 

[— 1, 1];  while  =  {ip^}^=0,  where  {^}f=0  is  any  continuous  function  (not  nec¬ 

essarily  a  polynomial)  with  the  property  'ip3(Ci)  =  Aj-  The  expansion  coefficients  are, 

Xj  =  xNj  and  u3  =  uNj,  therefore  xN(p3)  =  xNj  and  uN(^j)  =  uNj,  for  j  =  0, 1, . . . ,  N. 
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The  approximations  of  x  and  u  in  the  computational  space,  £,  are  given  by 

N 

*"«)  =  £<({)*«, 

3=0 

N 

«"({)  =  y>f  «)«« 

J=0 

The  approximation  of  i;  in  the  computational  space,  £,  is  then 

N 

j=0 


where  the  derivative  of  the  Lagrange  polynomial,  {0jv}jLo,  is  given  by  Equation  (2.38). 
Additionally,  in  the  collocation  method,  the  error  term,  eN,  is  ideally  forced  to  zero  at  each 
collocation  point,  therefore,  Equation  (2.55)  becomes 


f =  °. 

3=0 


i  —  0, 1, . . . ,  N. 


Collocation  methods  assume  the  title  pseudo  spectral  methods,  due  to  the  nodal 
nature  of  the  formulation — in  lieu  of  spectral  referring  to  a  transformation  from  physical 
to  spectral  space.  Common  Legendre  family  of  collocation  nodes  used  are  the  LGL,  LG 
and  LGR  points.  When  LGL  nodes,  {&}£L0,  defined  by  —1  =  £0,  £i;  •  •  •  ,€n-i,€n  =  1, 
are  used  for  the  discretization,  the  Legendre  PS  differentiation  matrix,  A,  is  given  by  Atj  = 
for  i,j  =  0, 1, . . . ,  N,  resulting  in  the  system  of  equations, 


N  .At 

J2AijXNj  -  ~Yf(xN\  uNl)  =0,  i  =  0, 1, . . . ,  N.  (2.56) 

3=0 
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In  addition  to  Equation  (2.38),  the  Legendre  PS  differentiation  matrix,  A,  may  be  defined 
in  terms  of  the  Legendre  polynomials  by  [46] 


Aij 


=  < 


£jv(£i)  1 

ln(€j)  €i-£j 

N(N+ 1) 

4 

N(N+ 1) 

4 

0 


i 

i  =  j  =  0, 
i=j  =  N, 


(2.57) 


Remark  2.3.  The  derivative  of  xN  (£)  at  each  LGL  point  {£i}^L0  ls  exactb'  equal  to 


N 


3=0 


for  any  polynomial  with  degree  less  than  or  equal  to  N  [46].  However,  a  feasible  solution 
to  the  equality  dynamical  constraint  may  not  exist.  In  order  to  guarantee  feasibility  of  the 
discretized  problem,  Gong  et  al.  [5],  suggest  a  relaxation  of  the  equality  constraint. 


Therefore,  Equation  (2.52)  may  be  discretized  with  the  following  inequality  constraint, 

<SN ,  i  =  0,1,...,  iV, 

where  5N  is  the  feasibility  tolerance  that  is  dependent  on  N  and  the  smoothness  of  x  and 
u  (see  Section  3.2);  and  ||  ('  represents  the  maximum  element  of  vector,  ('  e  Mn.  The 
initial  conditions  and  endpoint  conditions  may  be  approximated  similarly  by 


\x 


NO 


—  xn  <  5n  and  e(x 

^  no  —  v 


-NN 


<  8 


N 


Collocation  methods  have  become  popular  for  the  discretization  of  system  dynam¬ 
ics  in  direct  methods  for  optimal  control,  specifically  psuedospectral  methods.  Lor  the 
Legendre  PS  method,  the  LGL  points  become  the  discretization  of  choice. 
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2.3.I.2.  Galerkin  Numerical  Methods 


Galerkin  methods  can  be  subdivided  into  two  main  categories,  Bubnov-Galerkin 
and  Petrov-Galerkin.  The  weighted  residual  method  forms  the  basis  for  the  development 
of  these  approximation  techniques  and  will  help  to  distinguish  them  [57]. 

For  the  weighted  residual  method,  the  weak  integral  form  of  the  Equation  (2.55)  is 
solved  by  multiplying  by  a  test  function,  T,:,  integrating  over  the  domain,  and  ideally  we 
force  the  residual  term  to  zero, 

*i(0  (V(0  -  ^/(rt'v«),«'vK)))  rf?  =  *,{S)eK(t)d(  =  0,  (2.58) 

for  i  —  0, 1, . . . ,  N. 

Remark  2.4.  Setting  the  residual  terms  to  zero  in  Equation  (2.58),  (f\1  (f)df  —  0, 

for  each  i  —  0. 1, ,  N),  is  akin  to  forcing  the  orthogonality  of  the  space  spanned  by  T, 
and  eN  in  L2[— 1,1]. 

The  approximation  xN  and  uN  can  be  found  satisfying  Equation  (2.58).  Common 
test  functions  include  orthogonal  polynomials  (such  as  the  Legendre  polynomials,  L)  and 
the  trigonometric  functions.  In  the  Bubnov-Galerkin  method,  the  test  functions  are  the 
same  as  the  basis  functions,  unlike  the  Petrov-Galerkin  method,  where  the  test  and  basis 
functions  are  different.  The  general  structure  of  these  global  Galerkin  methods — as  well 
as  the  mathematical  notation  used  in  this  dissertation — is  provided  by  Giraldo  [44]  and 
discussed  in  the  following  sections. 

Bubnov-Galerkin  In  the  Bubnov-Galerkin  method  (or  often  called  simply  the  Galerkin 
method)  the  test  and  basis  functions  are  the  same.  These  functions  can  be  modal  or  nodal 
in  nature,  however,  in  this  section  the  focus  will  be  on  nodal  Galerkin  methods.  For  a  nodal 
Galerkin  approach,  it  is  common  to  use  a  Legendre  based  grid  such  as  the  LGL,  LG  or  LGR 
nodes,  and  define  the  test  and  basis  functions  as  Lagrange  polynomials  (2.32), 
of  order  N,  on  the  selected  grid.  A  popular  selection  for  interpolation  points  are  the  LGL 
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nodes  due  to  the  accuracy  of  LGL  quadrature  and  the  inclusion  of  endpoints,  t  —  ±1.  For 
this  discussion,  the  LGL  nodes  will  be  the  focus. 

The  approximations  of  x  and  u  in  the  computational  space,  £  G  [—1, 1],  are  given 
by 

N 

*”(()  = 

3=0 

N 

uN(Q  = 

3=0 

where  {,0jv}^=o  is  any  continuous  function  (not  necessarily  a  polynomial)  with  the  property 
ipj(€i)  =  Sij.  The  expansion  coefficients  are,  Xj  =  xNj  and  u:)  =  uNj,  therefore  xN(£j)  = 
xNj  and  uN(Zj)  =  uNj,  for  j  =  0, 1 ,N.  Equation  (2.58)  becomes  [44] 

J  <l>?(t)<j>jr(Z)d£xN3  J  $ (£)f(xN(Z),uN (£))<%  =  0,  (2.59) 

for  i  —  0, 1, . . . ,  N,  or  using  matrix-vector  notation, 

N 

^ Dij% N3  -Ci  =  0,  i  —  0, 1, ... ,  N. 

3=0 

The  Galerkin  differentiation  matrix,  D,  and  RHS  vector,  c  are  defined  as 

A j  =  f 

for  i,j  =  0,1,...,  TV. 

Using  LGL  quadrature,  D  can  be  calculated  with  the  relationship 

Q 

Dij  =  ^2<ff(£k)<j>?(£k)v>k  =  (6K  =  AijWi,  i,j  —  0,1, ...  ,N, 

k= 0 
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where  {v-',}jf()  are  the  LGL  weights  given  by  Equation  (2.49)  and  A  is  the  Legendre  PS 
differentiation  matrix  (2.57).  Since  LGL  quadrature  rule  is  exact  for  polynomial  integrands 
of  degree  less  than  or  equal  to  2N  —  1,  the  numerical  integration  is  done  exactly  when 
Q  =  N  LGL  integration  points  are  used. 

Using  LGL  quadrature  rule,  c  can  be  approximated  by  the  relationship 

A/  Q 

Ci  ~  y  ^2^^k)f(xN(^k),uN(^k))wk,  i  =  0,1,...,  N. 

k= 0 

When  Q  =  N  LGL  quadrature  points  are  used,  the  RHS  vector  approximation,  cN,  can  be 
expressed  in  the  simplified  form 

cNl  =  ^f(xN\um)wi,  i  =  0,1,...,  TV. 

Remark  2.5.  Recall  that  for  LGL  quadrature  rule,  integration  is  exact  for  polynomial 
integrands  of  degree  less  than  or  equal  to  2N  —  1.  If  0  =  (N  +  1)  integration  points  are 
used,  the  RHS  vector  will  integrate  exactly  when  f(x(f  ),  u(t))  is  linear  in  x  and  u.  In  the 
case  of  a  nonlinear  function  f,  the  accuracy  of  integration  (and  therefore  the  accuracy  of 
the  overall  approximation )  can  be  improved  by  increasing  the  number  of  quadrature  points 

Q. 

When  Q  =  N  LGL  quadrature  points  are  used  to  calculate  the  Galerkin  differentiation 
matrix  and  approximate  the  RHS  vector,  the  system  may  be  simplified  as 

N 

J2DijXNj  -cm  =  0,  i  =  0, 1, . . . ,  N.  (2.60) 

j=o 

Remark  2.6.  Inform  (2.60),  the  resulting  Galerkin  equations  that  must  be  satisfied  are 

(j2A^Nk  ~  y  f^mr  Um) )  Wi  =  0.  (2-61) 
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for  i  =  0,1, ....  N.  Note  that  the  relationship  in  parentheses, 


5™)  =0, 

3=0 

for  i  =  0,1 , ,N,  are  the  same  equations  that  would  be  satisfied  when  using  the  col¬ 
location  method  (see  Equation  [2.56]).  For  this  reason,  Bubnov  Gcderkin  with  numerical 
integration  is  sometimes  called  the  “collocation  method  in  the  weak  form.”  [57] 

In  the  words  of  John  Boyd,  “collocation — with  the  right  set  of  points — must  inherit 
the  aura  of  invincibility  of  the  Gcderkin  method.”  [58] 

Remark  2.7.  An  inequality  version  of  (2.61)  has  been  known  and  used  in  pseudospectral 
optimal  control  methods.  Details  on  its  relationship  with  Gcderkin  optimal  control  are 
addressed  in  Chapter  4  in  Remark  4.2. 

Due  to  the  results  of  Gong  et  al.  [5],  we  know  a  feasible  solution  to  the  equality 
dynamical  constraint  may  not  exist.  In  order  to  guarantee  feasibility  of  the  discretized 
problem,  the  following  inequality  constraint  is  suggested, 

<5n ,  i  =  0,1,...,  N,  (2.62) 

where  5N  is  the  feasibility  tolerance  that  is  dependent  on  N  and  the  smoothness  of  x  and 
u  (see  Chapter  4).  The  initial  conditions  and  endpoint  conditions  may  be  approximated 
similarly  by 


N 

YJDljxNj  -  cm 

3=0 


||^°-Xo|L<^  and  ||e(xJViV) <  5N . 

Remark  2.8.  The  inequality  formulation  (2.62)  introduces  some  fundamental  differences 
in  numerical  ancdysis.  In  the  Gcderkin  approach,  the  error  is  measured  by  the  L2-norm. 
As  a  result,  5N  has  a  feasibility  with  a  slightly  relaxed  bound,  by  a  factor  of  ffvfi  (see 


38 


Section  4.2  for  the  general  Galerkin  optimal  control  computational  strategies,  particularly 
Equations  (4.10)  and  (4.13)). 


Remark  2.9.  With  the  Galerkin  formulation  outlined  here,  the  initial  conditions  may  be 
enforced  in  a  weak  sense.  In  other  words,  ICs  may  be  imposed  only  up  to  the  order  of  ac¬ 
curacy  of  the  numerical  approximation  itself.  Consider  again  Equation  (2.59).  Integration 
by  parts  on  the  first  term  results  in  the  Galerkin  weak  form, 


In  terms  of  the  approximating  polynomials  (and  introducing  the  true  initial  condition, 
xN(— 1)  — *  x{— 1))  we  have 

_N  /•!  \  +  C 

J  *Nj  -  ‘/’f  MM-1)  +  </>iJV(1)a;Ar(1)  ~y  J  uN =  °> 


for  i  =  0,1 , ...  ,N.  Integration  by  parts,  yet  again,  results  in  the  Galerkin  strong  form, 


N  /  N 

J2Di^Nj + ^<v(-1)  ( (_1)^  -  x(-1) 

3=0  \j=0 

fN 

~  tfi1)  ~  )  -  °i  =  O' 

\j= 0 


(2.63) 


Equation  (2.63)  may  be  formulated  for  weak  enforcement  of  ICs  by  letting  x(— 1)  =  Xq  and 
xN  ( 1 )  =  xNN.  Additionally,  when  Q  =  N  LGL  quadrature  points  are  used  to  calculate 
the  Galerkin  differentiation  matrix  and  approximate  the  RHS  vector,  the  system  may  be 
simplified  as 


N 


J2DijXNj  +  -  CNi 

1=0 


0, 


(2.64) 
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for  each  i  —  0, 1, . . . ,  N,  where 


I  ar^0  —  x0,  i  =  0, 

=  ( 

[0,  if(). 

The  IC  term  n  now  provides  a  natural  way  to  introduce  initial  conditions  into  the  discretiza¬ 
tion  of  the  dynamics.  Again,  in  order  to  guarantee  feasibility  of  the  discretized  problem, 
the  following  inequality  constraint  is  suggested, 

<SN,  i  =  0,1,...,  iV,  (2.65) 

OO 

where  8N  is  the  feasibility  tolerance  that  is  dependent  on  N  and  the  smoothness  of  x  and  u 
(see  Section  5.1).  Fincdly,  the  endpoint  conditions  may  be  approximated  similarly  by 

Remark  2.10.  In  [11],  the  equation  resulting  from  dividing  Equation  (2.64)  by  Wi  is  in¬ 
troduced  for  primal-only  closure  conditions.  However,  for  feasibility  the  inequality  version 
of  this  expression,  Equation  (2.65),  must  be  used  for  computational  purposes.  It  should 
be  noted  that  if  the  equation  in  [11]  is  multiplied  by  Wj  first,  then  relaxed  as  an  inequality 
bounded  by  SN,  the  resulting  inequality  would  be  in  agreement  with  the  feasibility  of  the 
Galerkin  weak  boundary  formulation  discussed  in  Section  5.1.2  (see  Equation  (5.6)). 

Petrov- Galerkin  In  the  Petrov-Galerkin  method  the  test  and  basis  functions  are  different. 
As  with  the  Bubnov-Galerkin  method,  these  functions  can  be  modal  or  nodal  in  nature.  In 
this  section  the  focus  will  be  on  selecting  a  modal  test  function  and  a  nodal  basis.  This 
will  create  the  framework  that  will  be  used  in  Chapter  4.  For  this  formulation,  the  selected 
test  functions  will  be  the  Legendre  polynomials,  {Lj}^=0,  and  the  Lagrange  polynomials 


N 

y ^PijxNj  +  Ki~  cNi 

7=0 
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(2.32),  {4>^}f=0,  of  order  N,  will  be  the  basis.  Again,  for  this  discussion,  the  LGL  node 
structure  will  be  used  for  the  problem  discretization. 

The  approximation  of  x  and  u  in  the  computational  space,  £  G  [—1, 1],  are  given  by 

N 

rN(o  = 

3=0 

N 

«K(o  =  2>r  mNi, 

3=0 

where  is  any  continuous  function  (not  necessarily  a  polynomial)  with  the  property 

ipj(£i)  =  5ij.  The  expansion  coefficients  are,  Xj  =  xNj  and  u3  =  uNj,  therefore  xN(£j)  = 
xNj  and  uN(£j )  =  uNj,  for  j  =  0, 1 ,N.  Equation  (2.58)  becomes  [44] 

f  xNi  -  y  f  h(()f(xN(0,  uK(())di  =  0,  *  =  0, 1, ....  N, 

or  using  matrix-vector  notation, 

N 

Or  i  =  0,l,...,TV. 

3=0 

The  Galerkin  differentiation  matrix,  DL,  and  RHS  vector,  cL  are  defined  as 

=  f\ 

At  f1 

for  i,j  =  0,1,...,  TV. 

Using  LGL  quadrature,  DL  can  be  calculated  with  the  relationship 

Q 

Df-  =  J  =  0, 1,  • '  • ,  N, 

k= 0 
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where  {wk}^=0  are  the  LGL  weights  given  by  Equation  (2.49).  Again,  since  LGL  quadra¬ 
ture  rule  is  exact  for  polynomial  integrands  of  degree  less  than  or  equal  to  2TV  —  1,  the 
numerical  integration  is  done  exactly  when  Q  =  N  LGL  integration  points  are  used. 

Using  LGL  quadrature  rule,  cL  can  be  approximated  by  the  relationship 

Af  Q 

ci  ~  Y uN(tk))wk,  i  =  0, 1, . . . ,  TV. 

k= 0 

When  Q  =  TV  LGL  quadrature  points  are  used,  the  RHS  vector  approximation,  c%,  can  be 
expressed  in  the  simplified  form 


-Ni 


At  N 

=  T  !>(&)/(* 

k= 0 


Nk  - Nk 


)wk,  *  =  0,1,...,  TV. 


Remark  2.11.  Again,  if  Q  =  {N  +  1)  integration  points  are  used,  the  RHS  vector  will 
integrate  exactly  when  f(x(t  ),  u(t)  )  is  linear  in  x  and  u.  If  f  is  a  nonlinear  function, 
accuracy  of  integration  may  be  improved  by  increasing  the  number  of  quadrature  points  Q. 

When  Q  =  N  LGL  quadrature  points  are  used  to  calculate  DL  and  cf  ,  the  system 
may  be  simplified  as 


N 


j= 0 


0,  *  =  0, 1, . . , ,  N. 


(2.66) 


A  feasible  solution  to  the  equality  dynamical  constraint  may  not  exist.  In  order  to  guarantee 
feasibility  of  the  discretized  problem,  the  following  inequality  constraint  is  suggested, 


N 

3= 0 


-Ni 


<SN,  *  =  0,1,...,  TV, 


where  5N  is  the  feasibility  tolerance  that  is  dependent  on  N  and  the  smoothness  of  x  and 
u  (see  Chapter  6).  The  initial  conditions  and  endpoint  conditions  may  be  approximated 
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similarly  by 


ll®JV°“a;o||oo  <  $N  and  <5*. 

It  is  clear  that  in  the  Petrov-Galerkin  formulation  (2.66),  the  differentiation  matrix, 
Dl,  and  RHS  vector,  ,  do  not  simplify  as  cleanly  as  given  for  the  Bubnov-Galerkin 
formulation  (2.60).  This  inevitably  will  have  negative  effects  on  computational  efficien¬ 
cies.  However,  casting  the  problem  in  the  Petrov-Galerkin  numerical  form  will  have  nice 
consequences  when  applied  to  Galerkin  optimal  control,  as  will  be  shown  in  Chapter  6. 

2.3.2.  Spectral  Element  Methods 

Spectral  element  methods  are  local  (elemental)  applications  of  spectral  methods. 
They  combine  the  flexibility  of  finite  elements  with  the  accuracies  associated  with  spectral 
methods.  This  element-based  numerical  approach  is  advantageous  due  to  its  ability  to 
handle  complicated  geometries  and  can  be  easily  formulated  for  adaptive  strategies  [16, 
59].  In  this  section,  the  focus  will  be  on  two  Galerkin  formulations,  continuous  Galerkin 
and  discontinuous  Galerkin  element-based  methods.  Continuous  Galerkin  techniques  were 
first  applied  to  ordinary  differential  equations  (ODEs)  in  1972  by  Hulme  [14,  15]  and  a 
study  of  global  error  control  was  done  by  Estep  et  al.  [60]  in  1994.  The  first  analysis  of 
discontinuous  Galerkin  methods  applied  to  ODEs  was  done  in  1974  by  Reed  et  al.  [61] 
and  an  adaptive  error  control  technique  was  used  by  Bottcher  et  al.  [62]  in  1997.  More 
recently,  multi-adaptive  continuous  Galerkin  and  discontinuous  Galerkin  techniques  have 
been  studied  by  Logg  and  presented  in  a  series  of  papers  [63-65].  The  general  structure  of 
these  element-based  Galerkin  methods — as  well  as  the  mathematical  notation  used  in  this 
dissertation — is  provided  by  Giraldo  [44]  and  discussed  in  the  following  sections. 

2.3.2. 1.  Continuous  Galerkin 

Consider  a  continuous  element-based  Galerkin  approach  to  discretizing  (2.52).  Again, 
for  this  discussion,  the  LGL  node  structure  will  be  used  for  the  problem  discretization.  In 
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this  approximation,  the  weak  integral  form  of  (2.52)  inside  each  element,  Oe,  takes  the 
form  [44] 


[  #N(i)  «<'>"(()))  dt  =  0,  (2.67) 

J  ne 

for  e  =  1,2,..-..,  iVe  and  i  =  0, 1, . . . ,  N,  where  O  =  IJ^i  defines  the  total  domain. 
The  state  trajectory,  x(t),  is  approximated  inside  each  element,  .  by  interpolating  iV-th 
order  Lagrange  polynomials,  (t)}jL0,  at  the  nodes  0  by  the  relationship 

N 

x^N(t)  =  (t)x&Nj , 

3=0 

for  e  =  1,  2, ... ,  Ne,  where  {t^}jLQ  are  the  LGL  nodes,  {£j}f=0,  mapped  back  to  the 
physical  space  inside  each  element,  Vte.  Also,  let  uN(t)  be  an  interpolating  function  of 


N 


U 


(e)N 


(t)  = 


3=0 


where  {?//e  (t)}jL0  are  any  set  of  continuous  functions  (not  necessarily  polynomials)  with 


the  property  =  Sij.  Therefore  x('e')Nj  =  x^N(tf^),  for  e  =  1,2,  ...,iVe  and 


(eh 


j  —  0, 1, . . . ,  N,  and  similarly,  u^N3  =  u^N(t^).  The  relationship  between  the  physical 


time  domain,  t  e  [to,  tf]  = 

by  [44] 


41)  ANe) 
Lo  )  ln 


,  and  the  computational  space,  £  G  [—1, 1],  is  given 


£  = 


At(e)  V  f 


(e) 


1  and  dt;  = 


Af(e) 


dt, 


and  conversely, 


A  Ae) 


(e) 

0 


A  t(e) 

and  dt  =  — - — d£, 
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where  A  =  tff  —t^  is  the  size  of  each  element,  Qe,  which  can  be  nonuniform  in  length. 
The  Lagrange  polynomial  defined  on  the  LGL  computational  domain  is  given  by 


j= 0^ 

The  state  trajectory,  x,  can  now  be  approximated  inside  each  element,  Oe,  by 

N 

3=0 

where  {<^(£)}>=0  are  the  Lagrange  polynomials  defined  on  the  LGL  grid.  Likewise,  uN(£) 
is  given  by 

N 

u^NK)  =  K)s‘-w, 

3=0 

where  V'f  (6)  = 

Remark  2.12.  In  this  formulation  x^NN  =  yA+iyvo  and  ul'e'>NN  =  u^e+1^N0,  far  e  = 
1.2. ....  Ne  —  1.  This  continuity  condition  is  a  consequence  of  the  global  formulation  of 
the  problem  discussed  in  Remark  2.13. 


In  the  computational  domain,  £,  the  system  becomes 


b1  Afe)  r1 

/  0fK)/(x(e)''K),«(e)NK))<i«  =  O, 


'-1 


'-1 


for  e  =  1,  2, . . . ,  Ne  and  i  =  0, 1, . . . ,  N,  and  in  terms  of  the  approximating  polynomials 
becomes 


Y  f'  ^  f  0,'v/(^(')N(e).Mw'vK))<ie  =  o. 


3=0 


1-1 


1-1 
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In  matrix-vector  notation,  our  system  can  be  expressed  as 


N 

J2Dl?*(e>N;i  -  4*  =  o, 

3=0 

for  e  =  1,  2, . . . ,  Ne  and  i  —  0, 1, . . . ,  N .  The  local  element  (N  +  1)  x  (N  +  1)  Galerkin 
differentiation  matrix,  Die\  is  defined  as 

dV  =  f1  </>f  (O0f  (ZW,  i,j  —  0,1, ...  ,N.  (2.68) 

Using  LGL  quadrature,  l)le> ,  can  be  calculated  with  the  relationship 

Q 

Di?  =  '52<l>ir(£k)<i>?(£k)v)k  =  <i>?(£i)wi  =  Aijwi,  i,j  —  0,1, ...  ,  N,  (2.69) 
k= o 


where  {wi}^L0  are  the  LGL  weights  given  by  Equation  (2.49)  and  A  is  the  Legendre  PS 
differentiation  matrix  (2.57).  Since  LGL  quadrature  rule  is  exact  for  polynomial  integrands 
of  degree  less  than  or  equal  to  2N  —  1,  the  numerical  integration  is  done  exactly  when 
Q  =  N  LGL  integration  points  are  used.  If  Q  —  N  LGL  quadrature  nodes  are  used,  the 
approximation  to  the  (N  +  1)  x  1  RHS  vector  simplifies  to 


~(e)m 


A 


f(x 


for  e  =  1,2 ,...  ,Ne  and  i  —  0, 1, . . . ,  N. 

Remark  2.13.  So  far,  the  required  objects  have  been  identified  to  solve  the  system  numer¬ 
ically  with  element-based  Galerkin.  However,  since  nodal  basis  functions  are  continuous 
across  element  boundaries  and  LGL  nodes  include  both  endpoints,  a  global  solution  to  our 
problem  can  be  found.  To  do  this,  a  global  assembly  or  direct  stiffness  summation  can  be 
done,  where  the  direct  stiffness  summation  operator  is  f\^lv  [44] 
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The  global  equations  to  the  problem  become 

Np 

Y2DuxNpJ  -cNpl  =  0,  1  =  1,...,  Np.  (2.70) 

j= i 

The  global  Np  x  Np  Galerkin  differentiation  matrix,  I)u,  and  RHS  vector,  cNpI ,  are  then 
defined  by 

Ne  Ne 

Du  =  [\D(^  and  cN'J  =  /\c(e)7V\ 

e=l  e=l 

where  Np  =  ( NeN  +  1)  is  the  total  number  of  grid  points.  Note  that  the  direct  stiffness 
summation  operator,  /\^lv  does  the  mapping  (i,  e),  (j,  e )  — >•  I,  J  [44].  So  for  the  local 
differentiation  matrix  and  RHS  vector 

w(e)  Jfi) 
u00  U'01 

d (e)  d ^ 
u10  U11 

d d ^ 

UN0  aNl 

the  direct  stiffness  summation  operations  Du  =  f\  D^f  and  cNpI  =  f\  Se) Nl  result  in  the 

e=l  e=l 

global  Np  x  Np  differentiation  matrix  and  Np  x  1  RHS  vector, 
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In  order  to  guarantee  feasibility  of  the  discretized  problem,  the  following  inequality 


constraint  is  suggested. 


j= o 


<  6N\  I 


0,1,-..,  Np, 


where  SNp  is  the  feasibility  tolerance  that  is  dependent  on  Np  and  the  smoothness  of  x  and 
u  (see  Section  5.2).  The  initial  conditions  and  endpoint  conditions  may  be  approximated 
similarly  by 


H^JVpO  _  Xo||^  <  sNp  and  ||e(xJVpJVp)||oo  <  SNp. 

Note  that  although  the  discretization  for  Problem  (2.67)  is  element-based,  the  con¬ 
tinuous  Galerkin  formulation  (2.70)  is  global  in  nature.  This,  however,  is  not  the  case  for 
the  discontinuous  element-based  Galerkin  approach. 

2.3.2.2.  Discontinuous  Galerkin 

Consider  a  discontinuous  element-based  Galerkin  approach  to  discretizing  (2.52). 
Again,  for  this  discussion,  the  LGL  node  structure  will  be  used  for  the  problem  discretiza¬ 
tion.  In  this  approximation,  the  weak  integral  form  of  (2.52)  inside  each  element,  Qe,  takes 
the  form  [44] 


[  4e)N(t)  0(e)iV(f)  -  f(x^e)N(t),  u^N(t)))  dt  =  0, 

Jfie 

for  e  =  1,  2 ,Ne  and  i  =  0, 1, . . . ,  N,  where  f2e  defines  the  total  domain. 

The  state  trajectory,  x(t),  is  approximated  inside  each  element,  Qf  .  by  interpolating  iV-th 
order  Lagrange  polynomials,  {0je)iV(f)}^o,  at  the  nodes  {t(p  }jC0  by  the  relationship 

N 

x^N(t)  =  ^(j)fN(t)x<<e')Nji 

3=0 
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for  e  =  1,  2, ,  Ne,  where  {t^}jL0  are  the  LGL  nodes,  {£j}jL0,  mapped  back  to  the 
physical  space  inside  each  element,  Vte.  Also,  let  uN (t)  be  an  interpolating  function  of 

{uNj}U 


N 


U 


'•>«(«)  =  y,4)n  (t)u( 


(e)N(  .s ~(e)Nj 


3=0 


where  {?Ae 'N(t)}jL0  are  any  set  of  continuous  functions  (not  necessarily  polynomials)  with 


the  property  ip^N \u)  =  dlr  Therefore  x{e)Nj  _  x^N(t[p),  fore  =  1,2,...,  Ne  and 


(e)'l 


j  =  0, 1, . . . ,  N,  and  similarly,  u(e>Nj  =  u^N (t^).  The  relationship  between  the  physical 


time  domain,  t  e  [t0,  t/]  = 

by  [44] 


Ai)  ANe) 
c0  )  ln 


,  and  the  computational  space,  £  G  [—1, 1],  is  given 


1  and  di £  = 


A  f(e) 


dt, 


and  conversely, 

( \  Afte) 

t  =  — (f  +  1)  +  fo  and  dt  =  -ij—dt.i 

where  A =  t$  —t^  is  the  size  of  each  element,  Oe,  which  can  be  nonuniform  in  length. 
The  Lagrange  polynomial  defined  on  the  LGL  computational  domain  is  given  by 

0f(O  =  n^Ayr.  i  =  0, ....  TV. 

j¥=i 

The  state  trajectory,  x,  can  now  be  approximated  inside  each  element,  Qe,  by 


x 


(e)N(t\  _ 


N 


(0  =  zJ>j 

3=0 
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where  {<Af(0}yLo  are  the  Lagrange  polynomials  defined  on  the  LGL  grid.  Likewise,  uN(£) 
is  given  by 

N 

»(e)N(a  = 

3=0 


where  —  $ij-  In  the  computational  domain,  £,  the  system  becomes 


f1  ,  A  Ae) 


4t(Of&')N  K).«(e)"  «))<*«  =  0, 


'-1 


for  e  =  1,2,...,.  iVe  and  i  =  0, 1, . . . ,  N.  Integration  by  parts  on  the  first  term  yields  the 
weak  form  relationship 


- 1 

for  e  =  1,  2, . 
we  have 


Ms(m  +  [^(o^K)]!.,  J\f(OfW)N(o,^e>NKm=o, 

. . ,  Ne  and  i  =  0,1 , ,N,  and  in  terms  of  our  approximating  polynomials 


N 


+  v  [<(«^(0] 

j—o 


1 

-1 


A 

2 


^(o/m>N(o,ti(e)NKm=o. 


Remark  2.14.  With  the  discontinuous  element-based  Galerkin  approach,  we  let  x,  u  and 
the  basis  functions  be  discontinuous  across  element  edges.  A  numerical  flux  term  x!'*]  acts 
as  a  jump  condition  between  elements  [44].  Here,  we  consider  the  centered  flux  relation¬ 
ship,  x =  \  ( x ^  +  x^),  proposed  by  Delfour  et  al.  [66],  where  e  and  q  denote  the 
element  and  its  neighbor,  respectively. 
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Integrating  by  parts,  yet  again,  results  in  the  Galerkin  strong  form  relationship 


A 


<P?(Of(x(e)N  K).«w''K))ie  =  o, 


'-1 


for  e  =  1,2,...,  A/g  and  i  =  0,1,...,  iV.  Since  LGL  nodes  are  used,  the  boundary  term, 
rfe\  may  be  simplified  as 


f  |  (x^N0  -  x^NN)  ,  i  =  N, 
(o,  i^N, 


(Ne) 

Vi  = 


1  ^(Ne)N0  _  %( Ne-1)NN j  ,  i  _  0< 

0,  *7^0, 


for  elements  Oe  =  fix  and  Q,ve,  respectively,  and  for  each  other  element  (Oe  ^  Ol5  O^) 
we  have 


1  ^(e)JVO  _  ^e-l)NN^  , 
^(e)ATAT)  , 


^(6)  =  <  i  (^(e+Divo 


0, 

N, 


0,  i  7^  0,  A. 


In  matrix-vector  notation,  our  system  may  be  expressed  as 


N 

£ 

3=0 


D 


(e) -(e) 


y  a 


a;;-'  +  7), 


(e) 


-  A  =  o, 


for  e  =  1,  2, . . . ,  Ne  and  i  =  0,1, ...  ,N,  where  the  local  element  (A  +  1)  x  (A  +  1) 
Galerkin  differentiation  matrix,  I)(e} ,  is  the  same  as  that  defined  in  (2.68)  and  (2.69).  If 
Q  =  A  LGL  quadrature  nodes  are  used,  the  approximation  to  the  (A  +  1)  x  1  RHS  vector 
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simplifies  to 


d(e)m 


A  6e) 


f(x{e)N\uie)Ni)wh 


for  e  =  1,2 , . . .  ,Ne  and  i  =  0, 1, . . . ,  N .  The  local  discontinuous  formulation  therefore 
becomes 

N 

^  ~  c(e)Ni  =  0,  (2.71) 

j'=o 

for  e  =  1, 2 , ,Ne  and  i  =  0, 1, . . . ,  N.  In  order  to  guarantee  feasibility  of  the  discretized 
problem,  the  following  inequality  constraint  is  suggested. 


N 

^D^x(e)Nj  +  r)le) 


i=o 


y 


~{e)Ni 


<  5 


N 


for  e  =  1,  2 , ,Ne  and  i  =  0, 1, . . . ,  N,  where  5N  is  the  feasibility  tolerance  that  is 
dependent  on  N  and  the  smoothness  of  x  and  u  (see  Section  5.3).  The  initial  conditions 
and  endpoint  conditions  may  be  approximated  similarly  by 


x^N0-x0  <6n  and  e(x^NN)  <  6N . 

^  OO  —  v  7  OO  — 


Note  that  unlike  the  continuous  element-based  Galerkin  approach  that  can  easily  be 
formulated  for  a  global  solution  (2.70),  the  discontinuous  Galerkin  formulation  (2.71)  is 
purely  local  in  nature.  The  communication  between  elements  is  done  only  by  the  boundary 
term,  'q(e\  It  is  therefore  easy  to  see  that  the  discontinuous  Galerkin  formulation  is  easy  to 
parallelize  for  computational  efficiency.  Additionally,  the  flexibility  and  discontinuous  na¬ 
ture  of  the  formulation  lends  itself  to  problems  with  complex  geometries  and  discontinuous 
solutions. 

Global  spectral  method  techniques,  specifically,  collocation  (or  PS  methods)  have 
become  the  method  of  choice  for  discretizing  system  dynamics  in  a  number  of  direct  meth- 
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ods  for  optimal  control,  such  as  the  Legendre  PS  method.  Element-based  collocation  tech¬ 
niques  have  also  been  investigated  for  the  use  in  direct  methods  for  optimal  control  by  Ross 
et  al.  [12].  These  methods  have  gained  attention  due  to  their  flexibility  as  well  as  compu¬ 
tational  efficiency.  In  Chapter  5,  we  will  further  investigate  the  use  of  the  element-based 
Galerkin  formulations  for  optimal  control.  However,  we  will  first  consider  additional  mo¬ 
tivation  for  the  use  of  the  weak  integral  formulation  in  Chapter  3.  This  will  lead  to  the 
creation  of  a  family  of  Galerkin-based  formulations  called,  Galerkin  optimal  control. 
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CHAPTER  3: 

MOTIVATION  FOR  GALERKIN  OPTIMAL  CONTROL 


PS  methods  for  optimal  control  have  been  shown  to  be  good  all-round  methods  for 
solving  nonlinear  optimal  control  problems.  In  particular,  the  Legendre  PS  method  has 
gained  much  attention  in  recent  years.  As  mentioned  previously,  two  highlights  are  the 
successful  use  of  Legendre  PS  method  for  the  first  ever  zero-propellant  attitude  maneuver 
of  the  International  Space  Station  [4]  and  the  first  ever  minimum-time  rotational  maneuver 
performed  in  orbit  by  a  NASA  space  telescope  called  TRACE  [8].  The  formulation  of  the 
Legendre  PS  method  is  provided  in  Section  3.2,  but  first  consider  the  following  problem  of 
optimal  control. 

3.1.  Problem  B  (Bolza  Problem) 

Determine  the  state-control  function  pair,  t  (->•  (x,  u)  e  WNx  x  MA?n,  that  minimizes 
the  cost  functional 

J  —  J  F(x(t),  u(t))dt  +  E(x{— 1),  x(l)),  (3.1) 

subject  to  the  dynamics 


x(t)  =  f(x{t),u{t)), 


(3.2) 


endpoint  conditions 


e(x(-!),x(l))  =  0, 


(3.3) 
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and  mixed  state-control  path  conditions 


h(x(t),u(t ))  <  0.  (3.4) 

It  is  assumed  that  F  :  RNx  x  RNu  — »  R,  E  :  RNx  x  RNx  — »  M,  /  :  x  — >■ 

e  :  M'Vx  x  — >■  h  :  RNx  x  — y  RNh  are  Lipschitz  continuous  with  respect  to 
their  argument.  It  is  also  assumed  that  an  optimal  solution  (x*(-),  «*(•))  exists.  Additional 
assumptions  related  to  the  smoothness  of  x*(-)  and  /./*(■)  are  provided  in  the  feasibility  and 
consistency  theorems  included  in  Chapters  4,  5  and  6. 

3.2.  Legendre  Pseudospectral  Method 

In  the  Legendre  PS  method  approximation  to  Problem  B,  the  states  are  approxi¬ 
mated  with  globally  interpolating  iV-th  order  Lagrange  polynomials  defined  on  LGL  grid, 
{tj}jL0.  Recall  that  the  LGL  points  are  defined  by  t0  =  —1  <  ti  <  ■  ■  ■  <  tN  =  1  and  are 
the  roots  of  Equation  (2.42).  The  state  trajectory,  x(t),  is  approximated  by 

N 

x(t)  ~  xN(t)  =  (t)xNj . 


xNj^x(tj),  j  =  0,1,...,  A, 

and  similarly,  uNj  ~  u(t  j  ).  The  Lagrange  polynomials,  {(pfjjLo,  of  order  N,  are  given  by 
Equation  (2.32),  and  have  the  property,  ^ (L)  =  8^,  for  i,j  —  0, 1, . . . ,  N. 

In  the  Legendre  PS  method,  a  solution  to  the  differential  equation  x  —  f(x,  u)  =  0 
may  be  approximated  at  the  LGL  nodes  with  the  following  formulation 

N 

~  ™Ni)  =  °>  *  =  0, 1  (3.5) 

j= o 
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since,  the  derivative  of  xN(t)  at  each  LGL  point  is  exactly  equal  to 


N 

xN(ti )  =  y ^AjjxN\ 

3=0 

for  any  polynomial  with  degree  less  than  or  equal  to  N  [46],  where  A  is  the  Legendre 
PS  differentiation  matrix  (2.57).  A  feasible  solution  to  the  equality  dynamical  constraint 
may  not  exist.  In  order  to  guarantee  feasibility  of  the  discretized  problem,  Gong  et  al.  [5], 
suggest  the  following  inequality  constraint, 

N 

J2A^Nj  -  /(^>  um)  <  <*",  *  =  0, 1 

f=°  oo 

Remark  3.1.  Note  that  SN  is  the  feasibility  tolerance  and  is  dependent  on  N  and  the 
smoothness  ofx  and  u.  For  x  G  Wm,(X  (see  Appendix  C),  m>  2  and  u  G  C°[— 1, 1],  it  has 
been  proven  by  Gong  et  al.  [5]  that  5N  =  (N  —  1)  i_m. 

The  endpoint  conditions  and  path  constraints  are  approximated  similarly  by 

\\e(xN0,xNN)\L<6N, 

h(xNi,uNi)  <  SN  ■  1,  i  = 

where  1  denotes  [1, . . . ,  1]T.  Lastly,  the  cost  functional  J[x(-),u(-)]  is  approximated  by 
LGL  quadrature  rule, 

N 

J[x(-),u(-)}  «  JN(xN,uN)  =  ^2F(xm,um)wi  +  E(xno,xnn), 

i= 0 

where  xN  =  xN1 , . . . ,  iM] ,  uN  =  [u^0,  m^1,  . . . ,  «M]  and  {wi}^L0  are  the  LGL 

weights  (2.49)  associated  with  the  LGL  points,  To  allow  for  a  practical  search  area 
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for  the  optimal  solution  the  following  constraints  are  added 

{xNi  e  x,uNi  eu,  i  =  o,  l, ,  N}, 

where  X  and  U  are  the  search  regions  that  contain  the  optimal  solution  of  the  discretized 
nonlinear  optimization. 

The  resulting  optimization  problem  can  be  solved  using  existing  NLP  algorithms.  A 
feasible  solution  can  be  found  that  satisfies  the  tolerances  specified  in  the  NLP  by  adjusting 
the  order  of  polynomial  used  in  the  approximation.  The  theoretical  underpinnings  of  the 
Legendre  PS  method  have  been  studied  in  great  detail  over  the  last  two  decades.  Theorems 
for  feasibility,  consistency  and  convergence  of  the  Legendre  PS  method  approximations 
can  be  found  in  [3,  5,  6,  67,  68].  Although  the  Legendre  PS  method  has  been  shown  to 
produce  accurate  solutions  on  a  wide  variety  of  optimal  control  problems,  it  has  proven  to 
be  a  challenging  task  to  modify  this  method  to  efficiently  solve  multi-scale  problems,  one 
for  which  the  state(s)  and  control(s)  evolve  at  different  timescales.  An  example  of  such  a 
problem  is  given  next. 

Example  3.1.  Consider  the  following  boundary  value  problem  given  by  Williams  [69]  of 
minimizing  the  cost  function 


subject  to  the  dynamics 

x\(t)  =  X2  and  X2  (t)  =  C  sin(kt)  +u,  (3.7) 

and  with  boundary  conditions 

xi(0)  =  0,  £2(0)  =  0,  xi(tf)  =  1  and  £2  (tf)  =  0.  (3.8) 
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The  analytic  solution  to  this  problem  is  given  by 


,  ,  C  .  „  s  t3  t2  C 
xi (t)  =  -—sm (kt)  -  ci—  -  c2  —  +  —t, 
kz  o  2  k 

,s  C  „  ,  t 2  C 

W)  =  --rcos(kt)  -cl—-c2t+  — , 
k  2  k 


u(t)  =  —C\t  -  c2, 


(3.9) 

(3.10) 

(3.11) 


obtained  via  Pontryagin’s  maximum  principle.  The  constants  are  defined  as 

C  { C  C\  12  (  C  C  \ 

d  =  (^-cos (ktf)  +  psin(%)  -  jtf)  >  (3-12) 

c2  =  ^  (j^cos(ktf)  ~CP)  +  ^(1  +  -  jtf'j  ,  (3.13) 

£/  =  10,  C  =  0.1  and  k  =  8.  This  problem  was  solved  using  the  Legendre  PS  method 
with  optimality  and  feasibility  tolerances  of  5  x  10-5  and  5  x  10-4,  respectively.  The 
exact  solution  was  used  as  an  initial  guess.  Figure  12  shows  the  Legendre  PS  method 
approximations  of  order,  IV  =  50. 
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Figure  12:  Exact  solution  and  Legendre  PS  method  approximation  with  N  =  50  for  Exam¬ 
ple  3.1. 


From  Figure  1 2,  it  is  apparent  that  and  x2  evolve  on  different  timescales.  This 
is  continued  by  viewing  the  Legendre  spectral  coefficients  of  Xj  and  x2  presented  in  Fig¬ 
ure  13.  Note  the  difference  in  magnitude  of  the  x\  and  x2  Legendre  spectral  coefficients, 
particularly  between  n  =  5  and  40. 
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(a)  Legendre  spectral  coefficients  for  state  x\. 


(b)  Legendre  spectral  coefficients  for  state  x2. 


(c)  Legendre  spectral  coefficients  for  control  u. 


Figure  13:  Legendre  spectral  coefficients  for  x\,  x2  and  u  for  Example  3.1. 


The  difference  in  evolution  of  the  and  x2  system  dynamics  suggests  that  problem 
(3.6) — (3.8)  may  be  approximated  more  efficiently  using  a  multi-scale  numerical  technique, 
where  slow  state,  x\,  and  fast  state,  x2  are  discretized  on  different  timescales.  Consider  the 
following  general  multi-scale  optimal  control  problem,  in  which  the  slow  and  fast  states. 
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xs(t)  and  Xf(t),  are  associated  with  the  slow  and  fast  dynamics,  respectively.  This  modified 
Problem  B  is  presented  as  Problem  B. 

3.3.  Problem  B  (Multi-scale  Bolza  Problem) 

Problem  B.  Determine  the  state-control  function,  t  (xs,Xf,u)  €  RNxs  x  RNxf  x  M;V” , 
that  minimizes  the  cost  functional 

J[xs(-),xf(-),u(-)}  =  I  F(xs(t),xf(t),u(t))dt  +  E(xs(-l),xa(l),Xf{-l),Xf{l)), 

subject  to  the  dynamics, 


Xs(t)  =  f(xs(t),Xf(t),u(t)), 
Xf(t)  =  g(xs(t),xf(t),u(t))f 


endpoint  conditions, 


e(xs(-l),xs(l),xf(-l),xf(l))  =  0, 

and  mixed  state-control  path  conditions, 


h(xs(t),xf(t),u(t))  <  0. 


It  is  assumed  that  F: 


D  Nx 


X 


i>iVa 


lNx°  x  RNxf  x  RNu  — >•  R,  e: 


lNx*  x  RN*°  x  RNxf  x  RNxf  ->■ 


[ )NX 
\>Ne  u.  JU)NX 


f  x  RNu  -+R,E:  Rn*°  x  RNxs  x  MiVx/  x  — >•  M,  /: 


%/r: 


xJ 


/  x  -)• 


are  Lipschitz  continuous  with  respect  to  their  argument.  It  is  also  assumed  that  an 
optimal  solution  (x*s(-)iX*f(-),u*{-))  exists. 


A  number  of  methods  have  been  investigated  for  solving  multi- scale  problems 
such  as  Problem  B,  by  casting  the  slow  and  fast  dynamics  of  the  problem  onto  differ¬ 
ent  timescales.  Recently,  Desai  et  al.  [70]  and  Williams  [69]  provided  varied  techniques. 
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While  Desai  et  al.  use  an  elemental  approach  where  fast  and  slow  dynamics  are  treated 
with  similar  order  polynomials  within  different  size  subintervals  and  Williams  uses  a  tech¬ 
nique  where  the  slow  dynamics  are  approximated  with  a  weak  formulation.  Additionally, 
in  [71],  Gong  et  al.  investigate  the  use  of  a  Tau-like  method  to  discretize  the  slow  dynamics, 
after  discounting  a  straightforward  modified  Legendre  PS  method  approach.  This  modified 
Legendre  PS  method  will  be  presented  next  for  discussion  purposes. 

3.4.  A  Modified  Legendre  PS  Method  for  Multi-scale  Problems 

Consider  the  following  modified  Legendre  PS  method  approach  to  solving  Prob¬ 
lem  B.  The  states  and  controls  are  approximated  with  globally  interpolating  Lagrange 
polynomials  on  different  LGL  timescales.  The  slow  state,  xs(t),  is  approximated  on  sparse 
grid  {Tj}jL0  while  the  fast  state,  xj(t),  on  dense  grid  {tj}f=0,  where  M  <  N.  The  slow 
and  fast  states  are  defined  by  the  following  approximating  polynomials 

M 

Xs{t)  «  xf{t)  =  (t)xf3, 

3=0 

N 

Xf(t )  «  xj (t)  =  ^2(j)f(t)Xf3, 

3=0 

where  the  Lagrange  polynomials  {ff1  (t)}jLo  and  {of  are  defined  on  grids  {rj}jL0 

and  {tj}jL0,  respectively.  Let 

xfJ^xs(Tj),  j  =  0, 
xJJ~Xf(tj),  j=0,l,...,N, 

and  similarly,  uNj  ~  for  j  =  0,1 , _ ,N. 

Remark  3.2.  For  simplicity,  the  control  variable,  u(t),  is  approximated  on  the  dense  grid 
{tj}jL 0,  however  this  need  not  be  the  case.  Modifications  may  be  made  to  this  method  to 
cast  the  control  onto  a  unique  grid,  such  as  sparse  grid  {fj}jL0,  where  M  <  N  [71]. 
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A  solution  to  the  differential  equations  xs  =  f(xs,Xf,u)  and  if  —  g(xs,Xf,u)  may 
be  approximated  by  discretizing  the  slow  dynamics  over  the  dense  grid  with  the  following 
formulation 


M 

3=0 

N 

EaNN-NJ 

Xf 

3=0 


f(xf\xf,um)  =  0,  i 
g(xf,xf,um)  =  0,  i 


0,1 

0,1 


where  ANN  is  the  standard  (N  +  1)  x  (N  +  1)  Legendre  PS  differentiation  matrix  (2.57) 
and  ANM  is  the  (N  +  1)  x  (M  +  1)  Legendre  PS  differentiation  transformation  matrix 
(2.39).  The  slow  state  approximation  projected  to  the  dense  grid,  ,  may  be  calculated  by 
the  linear  mapping  T^M  =  of  (t,  )  with  the  relationship 


~Ni  \^TNM-Mj 
^ s  /  j  a  ^ s  i 

3=0 


for  %  —  0, 1, . . . ,  N,  where  TNM  is  the  (N  +  1)  x  (M  +  1)  transformation  matrix  (2.37). 

Remark  3.3.  Projecting  the  slow  dynamics  onto  the  dense  grid  provides  a  way  of  capturing 
the  high  frequency  information  of  the  fast  state  [71]. 

The  dynamical  constraints  therefore  become 

II  M  || 


3=0 


<SN ,  i  =  0,1,  ■  ■  ■ ,  N, 


N 

>  AN1-  T- 

3=0 


-g(xf,xf,um) 


<SN ,  i  =  0,1, . . . ,  N, 
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where  5N  is  the  feasibility  tolerance.  The  endpoint  conditions  and  path  constraints  are 
approximated  similarly  by 


h(xf\xf\uNi)  <8N  ■  1,  i  =  0, 1, . 


-,N. 


Lastly,  the  cost  functional  J[x(-),u(-)]  is  approximated  by  the  LGL  quadrature  rule, 


N 


J[x(-),m(')]  ~  JN(xf,xf,uN)  =  2_JF{xf\xf\uNl)wi  +  E(xf0,xfM,xf°,xfN), 


i= 0 


where  {wi}fL0  are  the  LGL  weights  (2.49)  associated  with  the  LGL  points,  {L}]g0  and 


xf  ,  xf  , 


r-MO  -Ml  -MMl  = N  r 
[■"s  1  t  ■  ■  ■  1  •L's  J  ?  **'/  [' 

and  uN  =  [uN0,  uN1, ,  uNN] 


_  r.T.^O  =JV1  ^iVTV] 


•^/  J 


To  allow  for  a  practical  search  area  for  the  optimal  solution  the  following  constraints  are 
included:  xf  exs,xf  G  X  j  and  uN  G  U,  where  Xs.  X  j  and  U  are  the  search  regions 
that  contain  the  optimal  solution  of  the  discretized  nonlinear  optimization. 

Example  3.1  (continued).  Consider  again  problem  (3.6)— (3.8)  solved  with  the  proposed 
multi-scale  Legendre  PS  method.  The  following  analysis  follows  that  given  by  Gong  et 
al.  in  [71].  Here  we  discretize  the  slow  state,  x\,  on  LGL  grid,  { t}  ,  and  fast  state, 
x2  and  control,  u,  on  LGL  grid  such  that  NXl  <  NX2.  This  problem  was  solved 

with  optimality  and  feasibility  tolerances  of  5  x  10-4  &  5  x  10  3,  respectively.  The  exact 
solution  was  used  as  an  initial  guess.  Figure  14  shows  the  visual  accuracy  of  the  multi-scale 
Legendre  PS  method  approximations  with  NXl  =  40,  NX2  =  50  and  Nu  =  50.  A  decrease 
in  the  approximation  order  of  the  slow  state  by  10  causes  a  significant  decrease  in  the 
accuracy  of  the  overall  approximation.  This  is  particularly  apparent  in  the  approximation 
of  the  control,  u,  in  Figure  14. 
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Figure  14:  Exact  solution  and  multi-scale  Legendre  PS  method  approximation  with  NXI  = 
40,  NX2  =  50  and  Nu  =  50  for  Example  3. 1 . 


Also  note  that  if  now  the  control,  u ,  is  cast  on  a  unique  LGL  grid  {fj}^0,  such  that  Nu  < 
NX2,  the  NLP  becomes  infeasible.  However,  if  u  is  cast  on  a  unique  LGL  grid  such  that 
Nu  >  NX2 ,  an  accurate  solution  is  obtained. 

Although  the  Legendre  PS  method  has  been  shown  to  produce  accurate  solutions  on 
a  wide  variety  of  optimal  control  problems,  approximating  the  derivative  of  a  function  using 
a  standard  PS  differentiation  matrix  may  introduce  errors  into  the  approximation.  This  may 
be  an  issue  when  using  a  multi-scale  approach  such  as  the  one  presented  in  Section  3.4.  It 
should  be  mentioned,  however,  that  the  Tau-like  method  of  Gonget  al.  [71]  produces  accu¬ 
rate  solutions  for  this  multi-scale  approximation.  However,  to  understand  what  happened 
with  the  straightforward  multi-scale  approach,  we  look  to  Jackson’s  Theorem. 
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3.5.  Jackson’s  Theorem 


Jackson’s  Theorem  allows  for  a  bounding  of  the  spectral  coefficients  (2.26),  (an}£l0, 
of  a  function,  H(t),  in  terms  of  the  function’s  derivative,  h(t). 

Lemma  3.1  (Jackson’s  Theorem).  [72]  Let  h{ff)  be  of  bounded  variation  in  [—1, 1],  Define 

H{£)  =  H{-1)  +  J*  h{s)ds, 

then  {an}™=0,  the  sequence  of  the  spectral  coefficients  of  11(f)  satisfies  the  following  in¬ 
equality 

kl  <  4=(^(A(0)  +  v(fc(0))-T, 

y/Tt  n6/z 

for  n  >  1  where  U(h(£))  is  the  least  upper  bound  of  |/i(£)|  and  V{h(f))  is  the  total 
variation  of  h(f  )  (see  Appendix  A). 

To  see  how  this  theorem  affects  a  PS  approximation  of  a  function’s  derivative,  let 
H  be  the  approximating  polynomial  error  of  a  function,  let  h  be  its  derivative,  and  let 
{an}ff=N+i  be  the  spectral  coefficients  of  H.  Jackson’s  Theorem  says  that  even  though 

OO 

an  may  be  very  small  (such  as  in  the  tail  of  the  spectral  coefficients  dropped  from 

n=N-\-l 

an  approximation)  the  error  in  the  approximation  of  the  derivative  may  be  relatively  large, 


an\^nV2<(U(h(0)  +  V(hm- 


This  factor  of  n3/2  could  potentially  add  unnecessary  errors  when  approximating  a  system’s 
dynamics  using  a  standard  differentiation  matrix. 


Remark  3.4.  This  idea  is  further  understood  by  considering  the  following  estimates  on  the 
approximation  of  any  function  fit)  G  H2  (see  Appendix  C).  Consider  (f  t  )  approximated 
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by  the  truncated  Legendre  series 


N 

PN(t )  = 

3=0 

where  Lj  is  the  Legendre  polynomial  of  order  j  and  {aj}jL0  are  spectral  coefficients  of  f. 
The  error  estimate  between  fit)  and  its  approximation,  pN(t),  is  given  by 

OO  OO  OO 

\\c{t) — p  (i) ii =  ajLj{t)  <  \aj\i 

j=N+ 1  LOO  j=N+l  j=N+l 

due  to  the  property  of  the  Legendre  polynomials  [46],  \Lj(t)\  <  1,  i  6  [—1, 1]  (see  Ap¬ 
pendix  A  for  definition  of  L°°-norm).  Additionally,  the  following  estimate  is  provided 
by  [46] 

||CW  -p"(()||t.  <  CiCoN-i,  (3.14) 

where  C\  is  a  constant  independent  of  N  and  Cq  =  V  (f('2>),  the  total  variation  off ^  (see 
Appendix  A ).  Now  consider  the  error  estimate  between  f  ( t )  and  its  approximation 

N 

PN(t )  =  'YjijLjit), 

3=0 

given  by 

OO  OO  ^  OO 

f(t)-pN(t)  =  M  Ljit)  <-  \ai\ i(i  +  1)i 

L°°  L L '  L°°  2  * — ' 

j=N+ 1  Loo  j=N+ 1  j=N+l 

due  to  the  property  of  the  Legendre  polynomials  [46],  Lfit )  <  \j(j  +  1),  t  G  [—1, 1]. 
Additionally,  the  following  estimate  is  provided  by  [46] 

C  (t)-pN(t)  <C3C2Nf  (3.15) 

L°° 
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where  C3  is  a  constant  independent  of  N  and  C2  =  (,  |  r2- N ,  the  Sobolev  seminorm  of 
(  ( defined  in  Appendix  C).  The  disparity  between  estimates  (3.14)  and  (3.15)  are  clear 
and  thus  pN  (t )  and  pN(t )  may  converge  at  different  rates.  In  fact,  while  estimate  (3.14) 
proves  convergence  of  pN  (t),  estimate  (3.15)  shows  that  pN(t)  may  not  converge  at  all. 
This  disproportionate  convergence  behavior  may  have  compounding  effects  when  using 
the  multi-scale  approach  for  optimal  control  (introduced  in  Section  3.4). 

Example  3.1  (continued).  Consider  again  problem  (3.6)— (3.8)  solved  with  the  proposed 
multi-scale  Legendre  PS  method  and  NXl  =  40,  Nx 2  =  50  and  Nu  =  50.  If  now  the 
optimality  and  feasibility  tolerances  are  relaxed  and  decreased  to  5  x  10  2  and  5  x  10_1, 
respectively,  the  NLP  constraints  are  satisfied  and  an  accurate  approximation  of  the  states 
and  control  is  obtained.  In  the  context  of  Jackson’s  Theorem  and  Remark  3.4,  this  should 
not  be  a  surprise.  From  Figure  13,  the  Legendre  spectrum  of  the  dropped  x\  modes  consist 
of  coefficients  with  magnitudes  of  O(10-3).  In  fact  with  the  lower  optimality  and  feasibility 
tolerances,  the  multi-scale  Legendre  PS  method  can  now  produce  accurate  solutions  for 
lower  order  approximations  of  x \ ,  such  as  NXl  =  10  (with  NX2  and  Nu  =  50).  However,  if 
a  reduction  in  the  control  approximation  is  also  the  goal  such  as,  Nu  <  40  (with  NX1  =  10 
and  NX2  =  50),  a  further  reduction  in  the  optimality  and  feasibility  tolerances  are  required 
in  order  to  satisfy  the  NLP  constraints. 

The  consequences  of  Jackson’s  Theorem  on  multi-scale  PS  methods  for  optimal 
control  are  significant.  Certainly,  the  class  of  problems  that  can  obtain  an  advantage  from 
this  approach  is  limited.  In  general  we  can  only  hope  to  benefit  from  multi-scale  PS  when 
reducing  the  polynomial  order  of  system  variables  that  have  extremely  small  Legendre 
expansion  coefficients  at  the  tail  of  the  spectrum.  This  will  require  us  to  use  a  different 
approach  if  we  hope  to  target  a  larger  class  of  optimal  control  problems.  Proposition  3.1 
highlights  the  advantage  of  an  alternate  method  of  discretizing  the  system  dynamics,  one 
in  which  the  derivative  of  higher  order  terms  does  not  disproportionally  add  to  the  overall 
error  of  the  approximation. 
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3.6.  Motivation  for  the  Weak  Integral  Formulation 


Proposition  3.1.  Let  Lj(t)  be  the  Legendre  polynomial  of  order  j.  Suppose 


OO 

^(t)  =  aJL^)> 

j=N+l 

OO 

j=N+l 


are  both  uniformly  convergent  on  [—1, 1],  then 


/I  OO 

Lft)eN  (t)dt  =  2  a,j, 

‘  1  A— AT  11 


j=N+ 1 

i-\-j  odd 


(3.16) 

(3.17) 


(3.18) 


for  all  0  <  i  <  N. 
Proof 


/I  OO  „1 

Li(t)eN (t)dt  =  a,j  /  Li(t)Lj(t)dt 

1  j=jV+l  ^-1 

OO  /  ,.i 

=  ai  (Li(t)Lj(t)  1-1  -  /  Lt(t)Lf  t)dt 

—  j\  r_Li  V  ^  —  1 


J'=JV+1 


Since  the  order  of  each  {Lj(t)}^0  is  less  than  Ar  and  the  order  of  each  {Lj(t)}^+1  is  bigger 
than  N,  the  orthogonality  of  the  Legendre  polynomials  implies 


Lft)Lj(t)dt 


0. 


Due  to  the  following  properties  of  the  Legendre  polynomial, 


Lk{  1)  =  1  and  Lfc(-1)  =  (-l)fc, 


Equation  (3.18)  follows. 


□ 
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Remark  3.5.  A  similar  result  is  found  for  the  case  that  Lagrange  interpolation  polynomials 
are  used  as  test  functions.  Let  0  be  the  Lagrange  polynomials  of  order  N,  defined 

on  grid  to  —  —1  <  ti, ... ,  tu-i  <  tjy  =  1.  Also,  let  Lfit)  be  the  Legendre  polynomial  of 
order  j;  let  rj(t)  and  /'/(  /;  )  be  defined  by  (3.16)  and  (3.17),  respectively.  Then 

/I  °°  rl 

<l>? (t)fi(t)dt  =  al  /  <f>i 

1  j=N+l 

=  jr  dj  f^(t)Lj(t)  l-i  -  /  . 

j=N+l  '  ' 

Since  the  order  of  the  polynomials  {fif  (t)}%L0  is  N  —  l  and  the  order  of  each  {Lj(t)}f=N+] 
is  bigger  than  N,  the  orthogonality  of  the  Legendre  polynomials  implies 

J  J>?{t)Lj{t)dt  =  0. 

Due  to  the  following  properties  of  the  Legendre  and  Lagrange  polynomials, 

(tk)  =  5ki,  Lk(l)  —  1  and  Lfc(-1)  =  (-l)fe, 


we  have 


<t>i  ( t)iK(t)dt  =  { 


E  %(-i),+1, 

j=N+l 

oo 

S  aji 

j=N+l 


i  =  0, 
i  =  N, 


0, 


i  f  0,  N. 
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Remark  3.6.  The  weak  integral  approximation  to  the  derivative  x ( t )  follows  from  Re 
mark  3.5.  Let 


N 

xN(t )  = 

3=0 

N 

xN(t)  =  ^ajLj{t). 

3=0 


Multiplying  x  by  a  test  function  <j)N(t )  then  integrating  over  the  domain  gives 


(t)x{t)dt 


<t>i  {t)x  ( t)dt+  /  (t)i  (t)dt, 


i -i 


'-i 


for  i  =  0,1,...,  iV,  where  the  residual  term,  eN(t),  and  its  derivative,  eN(t),  can  be  ex 
pressed  by 


OO 

j=N+l 

OO 

eN(t)  =  ajLj(t). 

j=N+ 1 


From  Remark  3.5,  the  weak  integral  residual  term  is 


<!>i{t)Lj(t)dt 


'  OO 

E  %(-i )j+\  i  =  0, 

j=N+ 1 

OO 

<  E  a3 ,  i  =  N, 

j=N +1 
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Therefore,  the  error  in  the  weak  integral  differentiation  term  may  be  bounded  as 


(t)x{t)dt 


'-i 


<t>f  {t)xN  (t)dt 


N, 


'-1 


(  oo 

X/  I a j  I  j 

j=N+l 


<  < 


X/  I a j  I  > 

j=N+l 

o, 


i  —  0, 
i  =  N, 
ifO,N. 


In  other  words,  the  accuracy  of  the  weak  integral  approximation  to  x  is  related  to  the 
Legendre  spectral  coefficients  of  the  dropped  modes.  If  the  Legendre  spectral  coefficients 
of  the  dropped  modes  are  negligible,  that  is 


N  - 

j=N+ 1 


where  5  <C  1,  then 


{t)x(f)dt 


(f>?  (t)xN  (t)dti 


for  i  =  0,1,...,  iV.  The  weak  integral  approximation  to  x  will  be  similar  to  the  accuracy 
of  the  approximation  to  x  itself  since 


L°° 


j=N+ 1 


oo 

ffoo  j=N  +1 


OO 

<  lai 

j=N+l 


Jackson’s  Theorem  (Lemma  3.1)  along  with  Remark  3.6  present  persuasive  argu¬ 
ments  for  the  use  of  the  weak  integral  approximation  (a.k.a.  Galerkin  methods)  in  place  of 
traditional  collocation  techniques  for  approximating  system  dynamics  in  direct  methods  for 
optimal  control.  We  will  see  that  Galerkin  methods  may  be  formulated  to  efficiently  solve 
multi-scale  problems  (see  Section  5.4).  As  a  preview,  Figure  15  shows  a  comparison  of  the 
exact  solution  to  Example  3.1  with  the  multi-scale  Galerkin  optimal  control  formulation 
numerical  solutions  with  NXl  =  3,  NX2  =  43  and  Nu  —  1.  This  problem  was  solved  with 
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optimality  and  feasibility  tolerances  of  5  x  10  4  and  5  x  10  3,  respectively,  and  the  exact 
solution  was  used  as  an  initial  guess. 


Figure  15:  Exact  solution  and  GOCM-MS  approximation  with  NX1  =  3,  NX2  =  43  and 
Nu  =  1  for  Example  3. 1 . 

Simulations  show  that  the  multi-scale  Legendre  PS  formulation  becomes  infeasible  for  the 
multi-scale  approximation  with  orders  NX1  =  3,  NX2  =  43  and  Nu  —  1  for  any  reasonable 
set  of  optimality  and  feasibility  tolerances  selected. 

Although  the  multi-scale  Galerkin  optimal  control  formulation  shows  promise  in 
solving  multi-scale  problems,  the  advantages  of  the  weak  integral  form  are  not  limited  to 
this  problem  set.  Additionally,  Galerkin  formulations  allow  for  the  weak  imposition  of 
boundary  conditions.  That  is,  end  conditions  may  be  enforced  only  up  to  the  accuracy  of 
the  approximation  itself.  Remark  2.9  highlights  this  property.  Galerkin  formulations  with 
weak  enforcement  of  boundary  conditions  have  been  shown  to  produce  improved  accu¬ 
racies  in  many  applications.  A  detailed  discussion  is  given  by  Canuto  et  al.  (see  Section 
3.7  of  [46]).  The  Galerkin  formulation  with  weak  imposition  of  end  conditions  may  also 
allow  for  problem  discretizations  with  other  than  LGL  points,  such  as  LGR  and  LG  (see 
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Sections  5.5.2. 1  and  5. 5. 2. 2,  respectively).  An  important  highlight  of  the  Galerkin  for¬ 
mulations  is  that  the  feasibility  and  consistency  theorems  are  proved  for  problems  with 
continuous  and/or  piecewise  continuous  controls  (depending  on  the  Galerkin  formulation). 

Lastly,  Galerkin  methods,  as  shown  in  Section  2.3.2,  may  be  easily  formulated 
as  element-based  methods,  both  continuous  and  discontinuous  (see  Sections  5.2  and  5.3, 
respectively).  These  element-based  formulations  may  have  benefits  in  approximating  so¬ 
lutions  to  optimal  control  problems  with  multiple  stages  or  those  with  discontinuous  solu¬ 
tions,  such  as  bang-bang  control  problems.  As  compared  to  global  methods,  these  element- 
based  techniques  may  be  formulated  to  require  less  computational  effort  and  memory.  Ad¬ 
ditionally,  the  discontinuous  Galerkin  formulation  may  advantage  from  parallel  computing. 
Chapter  4  will  introduce  a  new  numerical  technique  for  solving  nonlinear  optimal  control 
problems  founded  upon  the  Galerkin  methods  outlined  in  Section  2.3. 
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CHAPTER  4: 

GENERAL  GALERKIN  OPTIMAL  CONTROL  FORMULATION 


There  are  four  parts  to  the  numerical  solution  to  an  optimal  control  problem  using 
the  direct  method:  discretization  of  the  system  dynamics,  discretization  of  the  state-control 
constraints,  integration  of  the  cost  function  and  solving  the  NLP.  In  the  Galerkin  optimal 
control  approach  introduced  in  [73,  74],  we  use  Galerkin  techniques  to  discretize  the  sys¬ 
tem  dynamics  based  on  LGL  quadrature  nodes.  Recall,  that  the  LGL  points,  {tj}f=0,  are 
the  roots  of  Equation  (2.42)  and  therefore  include  the  endpoints,  t  =  ±1.  Thus  the  dis¬ 
cretization  works  in  the  interval  of  [—1, 1]  and  will  then  provide  the  framework  for  our 
problem  (e.g.,  the  state-control  constraints  will  be  discretized  at  these  nodes).  Recall  that 
LGL  quadrature  rule  will  provide  zero  error  for  polynomial  integrands  of  less  than  or  equal 
to  2N  —  1  [45].  Linally,  LGL  quadrature  rule  will  be  used  to  integrate  the  cost  function. 
The  resulting  optimization  problem  can  be  solved  using  existing  NLP  algorithms. 

In  addition  to  the  general  Galerkin  optimal  control  formulation,  this  chapter  con¬ 
tains  a  number  of  important  feasibility  and  consistency  results.  Theorems  4.1  and  4.2  prove 
that  nonlinear  program  Problems  GOCM-S  and  GOCM-S  (presented  in  Section  4.2)  have 
feasible  solutions  to  Problem  B,  where  controls  may  be  piecewise  continuous.  Additionally, 
Theorems  4.3  and  4.4  prove  that  the  general  Galerkin  optimal  control  numerical  approx¬ 
imation  is  consistent.  That  is,  nonlinear  programming  Problems  GOCM-S  and  GOCM-S 
are  consistent  approximations  to  the  continuous  optimal  control  Problem  B. 

4.1.  Method  for  Approximation 

In  the  general  Galerkin  optimal  control  approximation  to  Problem  B,  the  state  tra¬ 
jectory,  x(t),  is  approximated  with  globally  interpolating  N- th  order  Lagrange  polynomi- 
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als,  {(p^}^=0,  defined  on  a  grid  of  LGL  nodes,  {tj}^=0, 

N 

x(t)  «  XN(t)  =  (t)xNj . 

j= 0 


Due  to  the  property  of  the  Lagrange  polynomials,  <j)j(U)  =  Sij,  we  have 


'rNl=XN(tj),  .7=0,1 


x 


Also,  let  uN(t)  be  an  interpolating  function  of  {uNj}jL0, 

N 

nN(t)  = 

3= 0 

where  is  any  set  of  continuous  functions  (not  necessarily  polynomials)  with  the 

property  V(jV(L)  =  <%•  In  the  general  Galerkin  optimal  control  approach,  a  solution  to  the 
differential  equation  x  —  f(x,u)  =  0  may  be  approximated  at  the  LGL  nodes  with  the 
following  weak  integral  formulation  [44] 

/  (~ dt  ~  _  “*(*)))  =  °>  (4-l) 


for  i  =  0,1, ...  ,N.  In  terms  of  the  approximating  polynomials,  the  system  of  equations 
becomes 


N 


£ 


<,yd^-dtxN‘ 

1  dt 


'-i 


uN)dt  =  0, 


for  i  —  0, 1, . . . ,  N,  and  in  matrix- vector  form  is  given  by 

N 

^ DijXN 3  -Ci  =  0,  %  =  0, 1, . . . ,  N. 

3=0 
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The  (N  +  1)  x  (N  +  1)  Galerkin  differentiation  matrix,  I),  is  defined  by 

f1  M  df^ft) 

Dij  =  J  i,j  =  0,1  ,...,N,  (4.2) 

and  the  (N  +  1)  x  1  right-hand- side  (RHS)  vector,  c,  is  defined  as 

d=  j  ^(t)f(xN(t),uN(t))dt,  i  =  0, 1, . . . ,  N, 

The  Lagrange  polynomials,  0,  and  their  derivatives,  {©f  }jt0,  are  given  by  defini¬ 

tions  (2.32)  and  (2.38),  respectively.  If  LGL  quadrature  rule  is  used  with  Q  =  N  quadrature 
points,  the  differentiation  matrix,  D,  can  be  calculated  exactly  by  the  relationship 

Q  dd>N  d(bN 

Dij  =  yffifofc)— jj~{tk)wk  =  (/l  (/ -')»'/  =  i,  j  =  0, 1, . . . ,  N,  (4.3) 

fc=0 

where  the  LGL  weights,  {w{}^L0,  are  defined  by  Equation  (2.49)  and  A  is  the  Legendre  PS 
differentiation  matrix  (2.57).  The  RHS  vector,  c,  may  also  be  approximated  with  quadrature 
with  the  relationship 

Q 

Ci  ~  yy  (tk)f(xN(tk),  uN(tk))wk,  (4.4) 

k= 0 

for  i  =  0, 1, . . . ,  N.  If  again,  LGL  quadrature  rule  is  used  with  Q  =  N  quadrature  points, 
the  RHS  vector  approximation,  cN,  may  be  simplified  as 

N 

-m  =  y^(f)^(tk)f(xN(tk),uN(tk))wk  =  f(xN\uNl)wi,  i  =  0,1,...,  N.  (4.5) 

k= 0 

Remark  4.1.  Recall  that  for  LGL  quadrature  rule,  integration  is  exact  for  polynomial 
integrands  of  degree  less  than  or  equal  to  2N  —  1.  If  0  =  (N  +  1)  integration  points  are 
used,  the  RHS  vector  will  integrate  exactly  when  f(x(t),u(t))  is  linear  in  x(t)  and  u(t).  In 
the  case  of  a  nonlinear  function  f,  the  accuracy  of  integration  (and  therefore  the  accuracy 
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of  the  overall  approximation)  can  be  improved  by  increasing  the  number  of  quadrature 
points  Q.  However,  in  most  cases,  increasing  the  accuracy  of  integration  by  increasing  Q 
will  significantly  add  to  computation  time  due  to  the  required  interpolation  of  the  state  and 
control  vectors.  This  will  be  discussed  in  greater  detail  in  Section  5.5.1. 

Therefore  system  (4.1)  may  be  simplified  into  the  form 

N 

J2DijXNj  -  cNi  =  0,  i  =  0,1,..., TV.  (4.6) 

3=0 

Remark  4.2.  Inform  (4.6),  the  resulting  Galerkin  equations  that  must  be  satisfied  are 

^^AijX1*3  -  f(xN\  uNl)  j  Wi  =  0,  (4.7) 

for  i  =  i),l. ....  N.  This  implies  the  following, 

N 

J2AjXNj  -  f(xNi,um)  =  0,  (4.8) 

3=0 

for  i  —  0, 1, . . . ,  N.  Note  that  (4.8)  is  the  same  set  of  equations  that  would  be  relaxed  when 
using  the  Legendre  PS  method  (Section  3.2).  Hence,  numerical  solutions  to  system  (4.8) 
found  via  the  collocation  method  will  satisfy  the  Galerkin  relationships  in  (4.6).  However, 
inequality  versions  of  (4.7)  and  (4.8)  are  used  for  computational  purposes.  As  suggested 
by  Jackson’s  Theorem,  a  solution  of  the  inequality  version  of  (4.7)  may  not  satisfy  (4.8). 
In  fact,  the  ancdysis  for  the  Gcderkin  numerical  formulation  is  based  on  the  L2-norm.  As 
a  result,  the  inequality  bound  for  the  Gcderkin  formulation  is  not  simply  a  multiple  of  the 
quadrature  weight.  Shown  in  Equation  (4.13),  the  upper  bound  of  the  inequality  includes 
a  factor  of  y/w.  However,  this  relationship  draws  a  clear  connection  between  the  genercd 
Gcderkin  optimal  control  formulation  and  the  Legendre  PS  method,  and  will  be  exploited 
in  the  proof  of  convergence  (Theorem  4.4). 
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Remark  4.3.  Due  to  the  results  of  Gong  etal.  [5],  we  know  a  feasible  solution  to  the  equal¬ 
ity  dynamical  constraint  may  not  exist.  In  order  to  guarantee  feasibility  of  the  discretized 
problem  a  relaxation  of  this  constraint  is  used. 

The  dynamical  constraint  becomes 

N 

J2DijXNj  -  cNi 

3=0 

where  5N  is  the  feasibility  tolerance.  The  endpoint  conditions  and  path  constraints  are 
approximated  similarly  by 

\\e.(xNfxNN)\L<SN, 

h(xN\uNi)  <  8N  ■  1,  i  =  0,1,...,  N, 

where  1  denotes  [1, . . . ,  l]T .  Lastly,  the  cost  functional  J[x(-),u(-)]  is  approximated  by  the 
LGL  quadrature  rule, 

N 

J[x(-)X-)]  ~  JN(xN,uN)  =  J2F(xN\um)wi  +  E(xN0,xNN): 

i=0 

where  xN  =  xm, . . . ,  and  uN  =  [u^0,  uN1, . . . ,  uNN] .  To  allow  for  a  practi¬ 

cal  search  area  for  the  optimal  solution  the  following  constraints  are  added 

{xNi  EX,um  EU,i  =  0, 1, ,  N}, 

where  X  and  U  are  the  search  regions  that  contain  the  optimal  solution  of  the  discretized 
nonlinear  optimization. 


<SN,  i  =  0,1,...,  AT, 
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4.2.  Computation  Strategy 


The  computation  strategy  for  the  Galerkin  optimal  control  formulation  with  strong 
enforcement  of  BCs  is  presented  in  two  forms.  First,  the  strategy  for  the  continuous  prob¬ 
lem,  in  terms  of  the  approximating  polynomials  is  outlined,  denoted  as  GOCM-S.  Next, 
the  discrete  problem,  discretized  on  a  LGL  grid  is  presented,  denoted  as  GOCM-S. 

Definition  4.1.  Function  g(t )  is  called  piecewise  C1  if  3  t0  =  —1  <  t\  <■■■<  tk  =  1 
such  that  g(t )  is  C 1  on  each  subinterval  (U,  ti+ 1),  i  —  0, . . . ,  k  —  1;  lim  g(t),  lim  g(t)  and 

t->tk 

lim  g{t)  exist  for  i  —  1, . . .  ,k  —  1;  and  g(t )  is  either  left  or  right  continuous  at  each  point 

U- 

4.2.1.  Computation  Strategy  for  GOCM-S 

The  computational  strategy  of  the  GOCM-S  is  to  find  the  feasible  solution  xN(t)  G 
X  and  uN(t)  G  U  for  the  following  cases: 

Case  1.  u(-)  is  piecewise  C°  and  x(-)  G  C°  and  piecewise  C1, 

Case  2.  u(-),  x(-)  G  ifm_1  (see  Appendix  C)  and  m  >  2, 
that  minimizes 


J(xN(-),uN(-)) 


F(xn  (t),uN  (t))dt  +  E(xn(—1),xn(1)), 


(4.9) 


subject  to  the  Galerkin  constraints 


'-i 


(t)  (xN(t)  -  f(xN(t),uN(t)))  dt 


<  MwfN 


e(z"(-l),*'v(l))j|oo<MJV-”, 

|fc+(iw((),MN(«))||11<MJV-“, 


i  =  0,1, . . . ,  N, 


(4.10) 
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where  a  —  \  and  (m  —  1),  for  Case  1  and  2,  respectively;  M  is  a  constant  independent  of 
N  and 


{h,  h  >  0, 
0,  h  <  0. 


(4.11) 


4.2.2.  Computation  Strategy  for  GOCM-S 

The  computational  strategy  of  the  GOCM-S  is  to  find  the  feasible  solution  xN  G  X 
and  uN  G  U  for  the  following  cases: 

Case  1.  u(-)  is  piecewise  C°  and  x(-)  G  C°  and  piecewise  C1, 

Case  2.  u(-),  x(-)  G  H m_1  (see  Appendix  C),  m  >  2  and  x^m~l\t)  is  of  bounded 
variation  in  t  G  [—1,1]  (see  Appendix  A), 
that  minimizes 

N 

JN(ccN,  uN )  =  J2F(xNi,  uNi)wi  +  E(xno ,  xNN ),  (4.12) 

i= 0 


subject  to  the  Galerkin  constraints 


N 


^2DijXNj  -  cm  <  Mwf  N~a,  i  =  0, 1, . . . ,  N, 

1=°  oo 

lle^0,^))^  <  MN~a, 

h(xNi,  uNl)  <  MN~a  1,  i  =  0, 1, . . . ,  N, 


(4.13) 


where  a  —  \  and  (m  —  1),  for  Case  1  and  2,  respectively;  and  M  is  a  constant  independent 
of  N. 


4.3.  Feasibility  of  Solutions 

In  order  to  guarantee  feasibility  of  the  discretization.  Theorems  4.1  and  4.2  show 
that  a  relaxation  of  the  dynamical  equality  constraint  to  inequality  is  required,  for  GOCM- 
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S  and  GOCM-S,  respectively.  However,  first  a  buildup  of  lemmas  are  required  for  each 
theorem. 

Lemma  4.1.  [46]  Let  pN  (f)  be  the  N-th  order  truncated  Legendre  series  polynomial  ap¬ 
proximation  to  Q  G  Hm ,  t  G  [— 1, 1],  then 

||C(t)  -pN(t)\\L2  <  aia0A^"m,  Vf  G  [-1,1] 

and 

||C0)  -pN(t) \\Loo  <  a3a2iv^"m,  Vf  G  [-1,1], 

where  a  \  and  a 3  are  constants  independent  of  N ;  ao  =  |£|#m;iv,  the  Sobolev  seminorm  off 
(see  Appendix  C);  a2  =  f/ie  total  variation  of  f{m>  (see  Appendix  A);  and  m  >  0. 

(pN(t)with  the  smallest  norm  ||£(f)  —  pN(t)\\L2  is  called  the  N-th  order  best  polynomial 
approximation  of  (  in  the  L2-norm.) 

Lemma  4.2.  Let  (( t )  =  g(t )  +  hutc(t),  t  G  [—1, 1],  where  f,  utc  G  H°,  g  G  H1,  and 
utc(t )  =  u(t  —  tc )  zV  f/ie  unit  step  function  defined  by 

[0,  -1  <f<fc, 

[l,  fc<f<l. 

N 

Also,  let  pN(t )  =  ffpnLn  be  the  N-th  order  truncated  Legendre  series  polynomial  ap- 

n= 0 

proximation  to  Then 

||C(f)  -P^Wllia  <  +  62(f0,/z)lV_^,  Vf  G  [-1,1],  fc  7^  -1,1  and  |/i|  <  00, 

where  b\  and  lh  are  constants  independent  of  N ,  and  b0  =  j |/y  1 1  .  ( pN(t )  with  the  smallest 
norm  1 1  £  (f )  —  pN(t)  ||  2  z'.v  called  the  N-th  order  best  polynomial  approximation  of  f  in  the 
L2-norm.) 


84 


Proof.  Let 


N  N 

gN(t)  =  £>BL„  and  u£(t)  =  YunLn 

n= 0  n= 0 

be  the  truncated  Legendre  series  of  g  and  utc,  respectively.  Then 

N 

pN(t )  =  =  ^(t)  + 

n=0 

TV  TV 

^  ^QnLn  H~  h^^unLn^ 

n= 0  7T— 0 

for  t  G  [—1,1],  where 

Pn  =  gn  +  hun,  n  =  0, . . . ,  N. 

Therefore, 

N 

II CW  ~PN(t)  \\L2  =  C (t)~  Y^PnLn 

n= 0  X,2 

=  ||  {g(t)  -  gN{t))  +  h  (utc(t)  -  u£(t))  ||i2 
<\\g(t)  -  gN (t)\\L2  +  \h\\\utc(t)  -u?c(t)\\L2 

N  N 

=  g(t)  ~  YjlaLn  +\h\  Utc(t)  ~  YunLn 

n= 0  L 2  n= 0  £2 

From  Lemma  4.1, 

N 

git)  -  Y]gnLn  <  biboN-1,  t  G  [-1, 1], 

n= 0  J2 
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where  gN  is  called  the  polynomial  of  best  approximation  of  g  in  the  A2 -norm,  b\  is  a 
constant  independent  of  N  and  b0  =  1 1  /y  1 1  ^ .  Additionally,  from  [45], 


\h\ 


N 

n= 0 


<\h\ 

L2 


(4.14) 


where  the  normalizing  constants,  {'yn}'^>=0,  for  the  Legendre  polynomials  are  given  by 
Equation  (2.27)  and  the  Legendre  expansion  coefficients,  {-un}" L0,  are  defined  by  Equa¬ 
tion  (2.26).  Due  to  the  properties  of  the  Legendre  polynomials, 


Ln(t)  =  ^  x  (L'n+1(t)  -  ^_i(f))  ,  and Ln(l)  =  1, 


the  Legendre  coefficients  have  the  relationship 

Un  =  \  f  (L'n+ l(t)  -  L'n-lif))  dt=\  ( Ln-l(tc )  ~  Ln+1(tc))  , 
J  tc 

or  may  be  expressed  by 


Wn\  —  2 \(Ln-l(tc)  —  Ln+l(tc))  I  <  -  (\Ln_i{tc)\  +  \Ln+i(tc)  I)  . 


Since  the  Legendre  polynomial  has  the  bound  [72] 

4(^)3 

\Ln(t)\  <  1  T  >  t  —  1)  1) 

n  2 (1  —  tz)  4 

we  have  the  following  bound  on  un, 


b2  1 

MnF’ 
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for  n  >  2,  where 


h{tc,h )  = 


\h\ 


7T2  (1  -f2)4 


r, 


This  is  since 


112  2\/2 

- t  H - r  ^ - r  —  — r- ;  ri  >  2. 

(n  —  1)2  (71  +  1)2  (n  —  1)2  ns 


Therefore,  Equation  (4.14)  has  the  bound, 


N 


Ut, 


(f)  ^  'unLr 


n= 0 


1  1 


<62(  E  ^T)n  )  E 


L2 


>vn=Ar+l 


Vn=JV+l 


71 


However,  from  the  Integral  Test  Estimate  [75], 


Hence, 


fb  1  ^  [°  l  1 

lim  /  —da;  <  >  —  <  lim  /  —dx  = 

b^ooJN+1X-  b^ooJNX2  N 


\h\ 


N 


U* 


■  (t )  ^  ynLr 


i= 0 


<  2, 


L2 


and  finally, 


((t)—pN(t)  L2  <  bib0N  1  +  b-2N  2,  Vt  e  [—  1, 1],  to  7^  —  1, 1  and  \h\  <  00. 


□ 

Lemma  4.3  (Holder’s  Inequality).  [76]  Let  the  Holder  conjugates  p  and  q  be  real  numbers 
with  the  property  that 


1 

P 


+ 


1 

7 


1, 
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where  p  >  1  and  q  >  1.  Then  for  any  arbitrary  complex-valued  sequences  x  =  {6;-}/L0 
and  y  =  {vk}^=0  the  following  property  holds 

N  /  N  \p/N  \  q 

k= 0  \fc=0  )  \k= 0  J 

Moreover,  when  extended  to  integrals,  Holder’s  inequality  takes  the  form 

1  1 

^  \f(t)g(t)\dt  <  ^  | f(t)\pdtj  ^  I g{t)\9dtj  , 

where  f  and  g  are  assumed  to  be  p-th  and  q-th  power  summable,  respectively,  ont  G  [a,  b\. 

Lemma  4.4.  Let  be  the  Lagrange  interpolating  polynomials  of  order,  N,  de¬ 

fined  on  LGL  grid  {fi}£L0.  Then,  there  exists  a  positive  integer,  N0,  such  that,  for  any 
N>N0, 


I# 


N 

*  L2 


<  pwf  <  qN  2 , 


for  each  i  —  0, 1, . . . ,  N,  where  =0  are  the  LGL  quadrature  weights  associated  with 
the  LGL  points,  {ti}^L0,  and  p  and  q  are  positive  constants  independent  of  N. 

(  N  2  V 

Proof  From  [46],  the  discrete  norm,  ||£jv||iv  =  (  Yh  |£fc|  wk  )  •  has  the  property 

\k= o  J 


ll£jv||  L2  —  Pll£w||  N  i 


for  L,n  e  Pat,  where  p  is  a  positive  constant,  independent  of  N.  Since  <fN  G  Pat,  we  have 


KL<p|W 


N I 


liV’ 
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for  each  i  =  0, 1, . . . ,  N.  Furthermore,  from  the  property  of  the  Lagrange  polynomial, 

ffiU)  =  Sij,  we  have 

1 

w 2 

for  %  —  0, 1, . . . ,  N.  Also,  from  [46]  we  have,  for  i  =  1,  2, . . . ,  N  —  1, 


Cl 

N 


(1  -  (L)2)2  <  Wi  < 


-(1 

Ny 


(' u 


for  constants,  0  <  ci  <  c2,  independent  of  i  and  N;  for  i  =  0,  N,  we  have 


Therefore, 


w0,wN 


2 

N(N  +  iy 


for  each  i  —  0, 1, . . . ,  N.  Finally, 

||0f  ||L2  <  pwf  <  qN~ 2 

holds  for  all  N  >  N(h  where  q  is  a  constant  independent  of  i  and  N.  □ 

4.3.1.  Feasibility  of  GOCM-S 

Theorem  4.1  (Feasibility  of  GOCM-S).  Given  any  feasible  solution  1 1->-  (a:,  u),for  Prob¬ 
lem  B,  consider  the  following  two  cases: 

Case  1.  uf)  is  piecewise  C°  and  x  ( • )  e  67°  and  piecewise  C1, 

Case  2.  «(•),  x(-)  E  Hm~l  and  m  >  2. 

Then,  there  exists  a  positive  integer  N0  such  that,  for  any  N  >  N0,  GOCM-S  has  a  poly- 
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nomial  feasible  solution,  (xN(t),  uN(t))  such  that 


|| x(t) -xN(t)\\L2  <  MN~°  and  ||«(t)  -  uN(t) \\l2  <  MN~a, 

where  a  =  \  and  (m  —  1),  for  Case  1  and  2,  respectively;  and  M  is  a  positive  constant 
independent  of  N. 

Proof.  Let  pit)  be  the  (N  —  l)-th  order  truncated  Legendre  polynomial  approximation  of 
x(t).  By  Lemmas  4.1  and  4.2  there  is  a  constant  cq  independent  of  N,  for  any  N  >  N0, 
such  that 


\x(t)  -p(t)\\L2  <  CqN  ", 


where  a  =  \  and  (m  —  1),  for  Case  1  and  2,  respectively.  Define 


xN  it)  =  J  p(s)ds  +  x(— 1). 


Then  p(t)  =  xN(t)  and 


x(t)  -  xN(t)\\L2  <  2c0N  ", 


since,  from  Holder’s  inequality  (Lemma  4.3),  we  have 


x(t)  —  xN(t)  = 


(T(s)  —  p(s))  ds 


'-i 


—  /  l^(s)  —  p(s)\ds 


i-i 


<v/2  (  j  \x(s)  -  p(s)\~ds]  —  V2\\x(t)  ~p(t)\\L2  <  V2c0N 


(4.15) 


Let  uN(t)  be  the  N- th  order  Legendre  polynomial  so  that 


u{t)  -  ii  f)  L2  <  ci  N  ". 
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From  our  Galerkin  approximation,  Holder’s  inequality  (Lemma  4.3),  and  Lemma  4.4,  we 
have  for  each  k  —  0, 1, . . . ,  N, 

i 

J  <f>k  (t)  {xN(t)  -  f(xN(t),  uN(t)))  dt 

-l 

i 

<  J\<t>k  (*)  (**(*)  -  f(nN(t),  uN(t)))  | dt 

-l 

<  II <t>k  WIL2II xN(t)  -  f(xN(t),UN(t)) \\l2 
<  c2wl\\xN(t)  -  f(xN(t),UN(t)) \\l2 

<  c2wl\\x(t)  -xN(t)\\L2  +c2wl\\x(t)  -  f(xN(t),UN(t))\\L2 
=  c2wl\\x(t) -p(t)\\L2  +c2w%\\f{x{t),u{t))  -  f(xN(t),UN(t))\\L2 
=  c0c2w£N~Q  +  c2hwl  ||x(f)  -  xN(t)\\L2  +  c2l2wl  ||w(f)  -  uN(t)  ||L2 
<  c0c2wl  N~a  +  c0c2liwl  N~a  +  Cic2l2wl  N~a, 


where  {«’/,-} /Lo  are  LGL  quadrature  weights  and  U  and  l2  are  the  Lipschitz  constants  of  / 
with  respect  to  x  and  u,  respectively,  which  are  independent  of  N.  It  follows  that 


J  <t>k  (t)  {xN(t)  -  f{xN(t),uN(t )))  dt 


<  Mw\n~° 


l-i 


holds  for  each  k  —  0, 1, . . . ,  N,  and  all  N  >  N0,  where  M  is  a  constant  independent  of  N. 
For  the  endpoint  condition  we  have 


F(l) 

<V2( 


*"(1)  = 


(x(s)  —  p(s))  ds 


1-1 


<  /  \x(s)  —  p(s)|rfs 


'-1 


i(s)  -p(s)\2ds  J  =  V2\\ x(t)  -p(t)\\L2  <  V2 c0N  Q, 
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so  we  have,  by  Lipschitz  condition, 


e{xN  (— 1),  or™  (1))  <  MN~ 


For  the  path  constraint  let  V  =  {t\h(xN(t),  uN(t))  >  0}  ,V  =  [—1, 1]  \D,  since 
h(x(t),u(t ))  <  0.  Then 


\h+(xN(t),uN{t))\\L2  =  (  /  (h(xN(t),uN(t)))2dt 

'v 


<  (  I  (h(xN  {t),uN  (t))  —  h(x(t),u(t))Ydt 


< 


(h(xN (t) ,  uN (t))  —  h(x(t),u(t)))2dt  +  I  (h(xN (t) ,  uN (t))  —  h(x(t),  u(t)))2dt 
{h(xN  (t) ,  uN  (t))  —  h(x(t),u(t)))2dt 


I V  JV 

-1  \  2 


'-1 


=  \\h{xN{t),uN{t))  -  h(x(t),u(t))\\L2 
<  /3|| x(t)  -xN(t))\\L2  +  l4\\ u(t)  ~uN(t)) \\l2  <  MN~Q, 


where  /3  and  l4  are  the  Lipschitz  constants  of  h  with  respect  to  x  and  u,  respectively,  which 
are  independent  of  N.  Hence 


\\h+(xN(t),uN(t))\\L2  <MN~a. 

Thus  a  solution  (xN(t),uN(t))  to  GOCM-S  is  feasible!  □ 

4.3.2.  Feasibility  of  GOCM-S 

Theorem  4.2  (Feasibility  of  GOCM-S).  Given  any  feasible  solution  1 1->  (a;,  u),for  Prob¬ 
lem  B,  consider  the  following  two  cases: 

Case  1.  u(-  )  is  piecewise  C°  and  x  ( • )  e  C°  and  piecewise  C1, 

Case  2.  u(- ),  x(-)  G  Hm~l  and  m  >  2. 

Then,  there  exists  a  positive  integer  N0  such  that,  for  any  N  >  N0,  GOCM-S  has  a  feasible 


92 


solution,  (xN,uN)  such  that 


\\x(t)-xN(t)\\L2<MN~a, 

where  a  =  \  and  (m  —  1),  for  Case  1  and  2,  respectively;  and  M  is  a  positive  constant 
independent  of  N .  Additionally,  uN  {tf)  =  u[tf),for  i  —  0, 1, . . . ,  N. 

Proof  Let  p(t) be  the  (N  —  l)-th  order  truncated  Legendre  polynomial  approximation  of 
x(t)  in  the  L2-norm.  By  Lemma  4.1  there  is  a  constant  <7,  independent  of  N,  for  any 
N  >  Ni,  such  that 


\\x(t)  -p(t) ||Loo  <  diN  /3, 

where  /3  =  (m  —  |),  for  Case  2.  For  Case  1,  we  refer  to  [77-79]  which  show  the  truncated 
Legendre  approximation  for  discontinuous  functions  with  jump  discontinuity  (such  as  the 
step  function  defined  in  Lemma  4.2)  displays  Gibbs  phenomenon.  However,  the  maximum 
amplitude  of  the  overshoot  has  a  finite  limit;  we  conclude  that  for  Case  1,  (3  =  0.  Also,  by 
Lemma  4.2  there  is  a  constant  d2  independent  of  N,  for  any  N  >  A7,  such  that 

||®(f)  ~P(t) l|L2  <  d2N~a, 

where  a  —  \  and  (m  —  1),  for  Case  1  and  2,  respectively.  Define 

xN(t)  =  J  p(s)ds  +  x(— 1). 

Then  p(t)  =  xN(t )  and 


x(t)  —  xN(t)  L2  <  2 d2N  a , 
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since,  from  Holder’s  inequality  (Lemma  4.3),  we  have 

\x(t)  —  xN(t)\  =  J  (x(s)  —  p(s))  ds  <  J  |i(s)  —  p(s)\ds  (4.16) 

<V2  (^J  |i(s)  -p(s)\~ds^J  =  V2\\x(t)  -p(t)\\L2  <  V2d2N~a.  (4.17) 

Also,  let  uN(t)  be  an  interpolating  function  of  u(t), 

N 

uN(t)  =  YW(t)*Ni, 

3=0 

where  {;;/;jv }^0  is  any  set  of  continuous  functions  (not  necessarily  polynomials)  with  the 
property  ,  )  =  dtJ,  and  therefore 

uNi  =  u(tj). 

Since  xN (t)  is  a  A'-th  order  polynomial,  we  have 

N 

xN(tk)  =  y^AkjxNj, 

3=0 

where  A  is  the  (N  +  1)  x  (N  +  1)  Legendre  PS  differentiation  matrix  (2.57)  and 

xNk  =  xN(tk). 

Recall  that  the  LGL  quadrature  weights,  {wk}k=0,  have  the  property 

wk  <  d^N^il  -  (tk)2)K  k  =  1,2, . . . ,  N  —  1, 
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for  constant  d3  >  0  independent  of  k  and  N;  for  k  =  0.  Ar,  we  have 


w0,wN  = 


N(N  +  iy 


So,  we  have 


N 


y  kjX 


Nj  _  -Nk 


3=0 
.N, 


N 


J2A^Nj  -  f(xN\uNk) 


3=0 


Wk 


=  \xn  (4)  -  f(xNk,  uNk)\wk  =  | p(tk)  -  f(xNk,  uNk ) \wk 
<  I  p(tk)  -  f(x(tk),u(tk))\wk  +  I  f(x(tk),u(tk)  -  f(xNk,uNk)\wk 
=  I p{tk)  —  x(tk)\wk+  I f{x(tk),u(tk)  -  f(xNk,uNk)\wk 
<  | \p(t)  -  x(t)\\Loawk  +  h\x(tk)  -  xN(tk)  \wk 
<  d4WkN~P  +  V2lid2wkN~a , 


for  each  k  =  0,1,2,...,  iV,  where  l\  is  the  Lipschitz  constants  of  /  with  respect  to 
Putting  this  all  together,  we  conclude  that 


N 


- 


-Nk 


3=0 


<  dAwkN~ 0  +  V2hd2wkN-a  <  MwlN~a , 


for  each  k  —  0, 1,  2, . . . ,  N,  and  all  N  >  N:i,  where  M  is  a  constant  independent  of  N. 
For  the  endpoint  condition,  we  have 


x(tN)—XN(tN)  =  x(l)  —  xN  (1)  =  /  |±(s)  —  p(s)\ds 


N  t 


'-1 


-1 

'-1 


<V2  \x(s)  -  p(s)\2ds  )  =  V2\\x(t)  —p(t)\\L2  <  V2d2N  a. 


So,  by  Lipschitz  condition, 


e(xN (t0),xN (tN))  <MN  a. 


x. 
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For  the  path  constraint,  the  following  estimate  holds 


|  h(x(t),u(t))  —  h(xN (t) ,  uN (t))  |  <  l2\x(t)  —  xN (t)\  <  s/2l2d2N  ", 

for  each  k  =  0, 1,  2, . . . ,  N,  where  l2  is  the  Lipschitz  constants  of  h  with  respect  to  x. 
Hence, 


h(xNk,uNk)  <  h(x{tk),u(tk))  +  MN~a  •  1. 

Thus  a  solution  (xN ,  uN)  to  GOCM-S  is  feasible!  □ 

Remark  4.4.  Although,  Theorems  4.1  and  4.2  do  not  provide  exact  feasibility  tolerances  for 
the  existence  of  solutions  to  the  GOCM-S  and  GOCM-S,  we  can  be  confident  that  solutions 
do  in  fact  exist.  Precise  bounds  may  be  found  experimentally,  using  a  recursive  refinement 
process,  by  increasing  the  order  of  the  approximation,  N,  until  all  the  constraints  in  the 
NLP  are  satisfied. 

4.4.  Consistency  of  Solutions 

Theorems  4.1  and  4.2  show  that  solutions  exist  to  the  GOCM-S  and  GOCM-S,  re¬ 
spectively.  However,  the  question  still  remains — will  these  solutions  converge  to  those  that 
we  seek?  The  answer  is  yes — Theorems  4.3  and  4.4  presented  below  show  that  solutions 
to  GOCM-S  and  GOCM-S,  will  in  fact  converge  to  the  optimal  solution  of  Problem  B. 
However,  first  a  definition  and  lemma  are  required. 

4.4.1.  Consistency  of  GOCM-S 

Definition  4.2.  The  orthogonal  system  {f>k(t)}ff0  is  complete  in  L 2,  t  G  [—1, 1],  if  and 
only  if,  for  £(t)  G  L2,  the  condition  f>k(t)^(t)(t)dt  =  0,  V/c  >  0,  implies  ||£(t)||L2  =  0. 
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Lemma  4.5.  Let  eN  (t)  G  Hrn  with  m  >  \  ( see  Appendix  C).  Assume 


N  / 


1-1 


<  b0wfSN, 


for  each  i  =  0, 1, . . . ,  N,  where  LS  hap  range  interpolation  polynomial  of 

order  N  defined  on  LGL  grid,  {wj}^0  are  the  associated  LGL  quadrature  weights 

and  bo  is  a  constant  independent  of  N .  Also  assume  3  e{t)  e  Hm  with  m  >  so  that 


e  —  e 


N\ 


I L2 


<  h  5 


■N 


where  SN  <  N  a,  a  >  |  and  b\  is  a  constant  independent  of  N.  Then 


llell L2  ~  0. 

Proof.  Recall  from  Equation  (2.35)  that  the  orthogonal  Legendre  polynomials,  {Lj  (t  )  }j30, 
can  be  written  as  linear  combinations  of  Lagrange  polynomials,  {<j>? (t)}?L0,  defined  on  the 
LGL  grid,  {4}f=0,  as 

N  N 

Li(t)  =  (t)  =  (*)> 

i= 0  2=0 
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where  V  is  the  generalized  Vandermonde  matrix  (2.30).  From  Holder’s  inequality  (Lemma  4.3) 
and  Lemma  4.4,  for  each  i  —  0, 1, . . . ,  N,  we  have 


'-i 

< 


(f)f  (t)e(t)dt 

f1 

-l 


r: 


(t)  (eN(t)  -  eN(t)  +  e(f))  dt 


(e(t)  -  eN(t))  dt 


+ 


[t)eN  (t)dt 


<  /  |  <t>i(t)  (e(t)  -  eN(t))  |  dt  +  b0w?SN 

'-i 


<  IkfWll^ll e(t)  -  (t)  || L2  +  b0w?SN 


<  b2w?  5n  +  b0w 2  5JV  =  Ih/ujf  5 


—  h„r,n2KN 


From  [46],  we  have 


4(1  -  <w,<  4(1  -  (t,)2)*,  i  =  1,2, 


,7V-  1, 


for  constants  0  <  64  <  65,  independent  of  i  and  Ar;  for  i  =  0,  N,  we  have 


wo,  wat  = 


N(N  +  1)' 


Also  recall,  from  [72],  that 


\LM  <  ~ 


4(j 


j2(l  -f2)4 


r, 
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Since 


f-i  ^ (t)e(t)dt  <  b3wf5N,  for  each  i 


0, 1, . . . ,  N,  we  have 


Lj(t)e(t)dt 


1 


N  ! 

Egv.)  / 

i= 0  J~l 


< 


TV  ! 

E \L>(U)\  /  T-f 

i=0  J -1 


(t)e(t)dt 


4> i  (t)e(t)dt 

TV 


< 


i=0 


<  5- 


TV 


2v/2 


TV-1 


=  <5 


TV 


(AT(AT  +  1))2 
2\/2 


i— 1 


1 

) 

iV§ 

) 

,\'  -  1 

r  +  &6TT-r  I  <M"  (- 
(N(N  +  1))*  jiN~2  J  \J 


N\  2 


for  each  j  =  1 ,N,  and  constant,  b7,  for  all  N  >  N0.  Since  5N  <  N  “  and  a  >  |,  it 
follows  that  when  N  — >  00  we  have 


0, 


for  each  j  =  0, 1, . . . ,  N.  Since  the  Legendre  polynomials,  {L.,}°L0,  are  complete  in  L 2 
space  [80]  and  e  _L  E,  for  all  j  —  0, 1, . . . ,  N,  we  can  conclude, 


e(f)||L2  =  0. 


□ 

Theorem  4.3  (Consistency  of  GOCM-S).  Suppose  (xN (t) ,  uN (t))  is  a  solution  of  GOCM- 
S  and  there  exists  (x(t),u(t))  such  that  u(-),  x(-)  G  Hm  l  with  m  >  2.  Also,  suppose 
xN(t)  — >  x(t)  uniformly,  and 

\\x(t)-xN(t)\\L2<KSN,  (4.18) 

Ht)-uN(t)\\L2<K5N,  (4.19) 
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where  SN  <  N  ",  a  >  \  and  K  is  a  constant  independent  ofN.  Then  (x(t),u(t))  satisfies 

ll®(f)  -  f(x{t),u(t)) \\L2  =  0, 
e(x(-l),x(l))  =  0, 
h(x(t),u(t ))  <  0, 

and  is  an  optimal  solution  to  Problem  B. 

Proof.  Let  eN(t)  =  xN(t)  —  f  (xN (f) ,  uN (t))  and  e{t)  =  x(t)  —  f(x(t),u(t)).  From 
Lemma  4.5,  to  prove  ||  <t)  ||l2  =  o,  it  is  enough  to  prove 

J  <t>k  (t)e(t)dt  <  c0w£5n, 

for  each  k  —  0, 1, . . . ,  N,  where  5N  <  N~a,  a  >  Consider 

J  <i>% (t)e(t)dt 

<  J  <Pk  0)  (xN(t)  -  f(xN(t),uN(t)))  dt  +  J  </>%  (t)  (z(t)  -  xN(t))  dt 

+  J  <i>k{t)(f(xN{t),uN(t))^f(x,u))dt 

<  c2wl5N  +  wl\\x(t)  -xN(t)\\L2  +wl\\f(x(t),u(t))  -  f(xN(t),u(t))\\L2 

<  c2wl5N  +  wl\\x(t)  -xN(t)\\L2  +l1wj‘\\x(t)  -xN{t)\\L2  +  l2wl\\u{t)  -uN(t)\\L2 
<  c2w\ SN  +  Kw\ 5n  +  hKw\&N  +  hKw\ SN  <  c0wUN, 

where  l\  and  l2  are  the  Lipschitz  constants  of  /  with  respect  to  x  and  u,  respectively,  which 
are  independent  of  N.  It  follows,  from  Lemma  4.5,  that  as  N  — *  oo  we  have 

II e Hz,2  =  11^0)  -  f{x{t),u(t)) \\L2  =  0. 
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For  the  endpoint  condition,  since  xN(t)  — >  x(t)  uniformly,  we  have 

0^(1)  — y  x(l)  and  xN(— 1)  — >  x{— 1). 

Since,  from  the  formulation  of  the  computational  strategy  we  have  |e(aJV(-l),aJV(l))|  < 
MN~a,  we  conclude  that  e(x(— 1),  x(l))  =  0  as  N  — >  oo. 

For  the  path  constraint,  since  h(x(t),u(t ))  is  piecewise  C1,  if  h(x(t*),u(t*))  >  0, 
3  an  interval  (a,  b )  in  which  h(x(t),u(t ))  >  0.  Then 

\\h(x(t),u(t))\\L2{a!b]  =  (h{x(t),u(t)))2dtj  >0.  (4.20) 

However, 


\\h(x(t),u(t))\\L2{ab)  <  || h(x(t),u(t))  -  h+(xN(t),uN(t))\\L2{ab)  +  ||/r+(xAr(f),M7V(f))|| 
<  || h(x(t),u(t))  -  h(xN(t),uN(t))\\L2{ab)  + MN~a 
<  h\\x(t )  -  xN(t))\\L2  +  l4\\ u(t)  -  uN(t))\\L2  +  MN~a, 


L2{a,b) 


where  (3  and  I4  are  the  Lipschitz  constants  of  h  with  respect  to  x  and  u,  respectively, 
which  are  independent  of  N.  Hence,  this  is  a  contradiction,  therefore  h(x(t) ,  u(t))  <  0  as 
N  — >  00. 

Suppose  that  (x(t),u(t))  is  not  optimal.  Then  3  (x* (t) ,  u*  (t)) ,  so  that 


J  (x*(-),u*(-))  <  J  (x{-),  «(■))■ 


Also,  3  (x*(t),u*(t))  such  that 


x*N(t)  -  x*(t)\\L2  <  MN~a  and  || u*N(t)  -  u*(t)  \\L2  <  MN~a , 
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where  (x*N(t),u*N(t))  is  a  feasible  trajectory  of  GOCM-S.  Therefore 

(4.21) 

However, 


I J  -  J (>(-)X-))| 

<  J  I F(xN(t),uN(t))  -  F(x(t),u(t))\dt  +  \E(x»(-1),xn(1))  -  E(x(-1),  x(l))  | 
<  V2\\F(xN(t),uN(t))  -  F(x(t),u(t))dt\\L2  +  |^(a;Ar(-l),a;Ar(l))  -  E(x(- l),x(l)) 

Due  to  the  Lipschitz  condition  and  assumptions  (4.18)  and  (4.19)  we  have 


lim  J  ( xN( •) 

N- 5-00  V 


uN(-)) 


j  (x(-)X-)) 


o. 


Similarly, 


lim 

iV— >-oo 


j  (x-Ka  «•"(■))  -j  (i*(o,  «•(■)) 


0. 


Therefore,  from  (4.21)  we  have 


J  (>*(-)X(-))  >  J (z(-)XO)- 


This  is  a  contradiction,  since  we  assumed 


J  (®*(0>w*(0)  <  J  O(-)X'))- 


We  conclude  that  (x(t),u(t))  achieves  an  optimal  cost  and  therefore  is  an  optimal  solution 
to  Problem  B !  □ 
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4.4.2.  Consistency  of  GOCM-S 

Lemma  4.6  (Theorem  6.5.5,  [81]).  Let  f  be  Riemann  integrable  in  [—1, 1]  and  (t)}^=0 
be  the  Lagrange  polynomials  of  order  N  defined  on  a  LGL  grid,  {tk  }/)=0-  Define 

N 

f(t)  = 

k= 0 

where  fNk  =  fN {tk),for  k  —  0, 1, . . . ,  N.  Then 

N  i*  1  /»1 

lim  y^J(tk)wk  =  lim  /  fN(t)dt  =  /  f(t)dt, 

N— >00 '  J  N^fOO  /  -i  /  1 

k=0  J~l  J~1 

where  {wk}^=0  are  LGL  quadrature  weights  associated  with  the  LGL  points,  {tk}k=0. 

Theorem  4.4  (Consistency  of  GOCM-S).  Suppose  { (xNk,  uNk),  0  <  k  <  iV}“=JVi  is  a  se¬ 
quence  of  solutions  to  GOCM-S,  [t  H »  (xN(t),uN(t))}™=  _N  are  their  interpolating  func¬ 
tions  and  there  exists  functions  (x(t),u(t))  such  that  u(- ),  x(-)  G  Hm~l  with  m  >  2  and 
x(r"-  l,(t)  is  of  bounded  variation  in  t  G  [—1, 1]  (see  Appendix  A).  Also,  suppose 

lim  || u(t)  -  uN(t)\\LOO  =  0,  (4.22) 

JV— »oo" 

lim  || x(t)  -  xN(t)\\  =  0.  (4.23) 

At — 1 1  1  1  LJ 


Then  (x(t)}u(t))  satisfies 

( 

||£(f)  -  f(x(t),u(t)) \\Loo  =  o, 
<  e(x(-l),a;(l))  =  0, 
h(x(t) ,  u(t))  <  0, 

and  is  an  optimal  solution  to  Problem  B. 
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Proof.  This  proof  follows  the  outline  of  Theorem  2  given  by  Gong  et  al.  in  [5].  First,  from 
assumptions  (4.22)  and  (4.23)  it  is  easy  to  show 

lim  || x(t)-xN(t)\\  =  0. 

TV— »oo  MJj 

Next,  suppose  that  (x(t),u(t))  is  not  a  solution.  Then  there  is  a  time  r  e  [—1, 1]  such  that 

X(r)  -  f(x(T),u(T))  ^  0. 

From  [82],  the  LGL  nodes,  {U}f=0,  are  dense  when  N  — >  oo.  Then  there  exists  a  sequence, 
{i^},  where  0  <  iN  <  N,  and  with  property 

lim  tjN  =  r, 

iV^oo 


so  that 


x(r)  -  f(x(r),u(T ))  =  lim  (xN(tiN)  -  f  (xN (tiN) ,  uN (tiN)))  ±  0.  (4.24) 

7V-»oo 

Also,  we  have 

N 

xN(tiN)  =  ^AiNjxN\ 

3=0 

where  A  is  the  Legendre  PS  differentiation  matrix  (2.57).  Therefore,  from  Theorem  4.2, 


N 

N 

»,v.rrV'  -  cmN 

= 

,  umN) 

3=0 

3= 0 

xN(tiN)  -  f(xN(tiN),uN(tiN))\wiN  =  MwiNN 2  m, 
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where  w{n,  is  the  LGL  quadrature  weight  associated  with  LGL  point,  t(,v,  for  each  iN .  This 
implies 


lim  (xN(tiN)  —  f(xN(tiN),uN(tiN)))=  lim  MN*  m  =  0, 

jV-»  oo  v  '  N^roo 

which  contradicts  Equation  (4.24).  We  conclude  that  (x(t),u(t))  is  a  solution. 

For  the  path  constraint,  we  consider  the  same  contradiction  argument  given  above. 
For  the  endpoint  condition,  since  xN(t)  — >  x(t)  uniformly,  we  have 

0^(1)  — »  x(l)  and  xN(— 1)  — >  x(— 1). 

Since,  from  Theorem  4.2,  we  have 

|e(aJV(-l),aJV(l))|  <  MN 1-m, 


we  conclude  that 


e(x(— 1), x(l))  =  lim  e(xw(— 1), ^(1))  =  lim  e(xN0,xNN)  =  0. 

N— »oo  N— »oo 


For  the  cost  functional  we  have 


N 


NO  - NN\ 


JN(ccN,uN)  =  J2F(xNk,uNk)wk  +  E(xN",x 

k= 0 


and 


J(x(-),u(-))  =  /  F(x(t),u(t))dt  +  E(x(—l),x(l)). 


i-i 
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By  Lemma  4.6,  we  have 


N 


1-1 


F(x(t),u(t))dt  =  lim  F(x(tk),u(tk))wk, 

N^rooz — J 


k= 0 


and  therefore 


F(x(t),u(t))dt 


i-i 


N 


N 


-Nk  -Nk 


^inio  |  y  F(xNk,uNk)wk  +  [F(x(tk),u(tk)  -  F(xNk , 

k= 0  k= 0 


However, 


N 

lim  [F(x(4),m(4))  -  F(xNk,uNk)]  wk 

W— >■  ooz — ' 
k=0 

N 

<  lim  V'|F(x(4),  w(ffe))  -  F(xJV(ffe),MJV(4))|wfc 
k=0 

N  N 

<  h  lim  V'|x(4)  -  xN(tk) \wk  +  l2  lim  Y^|m(4)  -  MJV(ffe)|wfe  =  0, 

^  1  1  N—±rv-X - 1 


fc=0 


fc=0 


where  /i  and  l2  are  the  Lipschitz  constants  of  F  with  respect  to  x  and  u.  Thus  we  conclude, 


c 


iV 


F(x(t),u(t))dt  =  lim  'S^F(xNk,uNk)wk. 

TV— ^oo-^ — ' 


k=0 


Finally,  by  Lipschitz  condition, 


lim  F(V  , or  )  =  E(x(— 1), x(l)), 

N-x  oo 


and  the  limit 


lim  /(/,/)  =  J  (x(0,u(0) 

7V-»oo 


(4.25) 


follows. 


Wk 
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Let  { (. x*Nk ,  u*Nk),  0  <  k  <  N}°^=n^  be  an  optimal  sequence  of  solutions  to  GOCM- 
S  and  {t  i — y  (x*N(t),  u*N(t))  }“=  be  their  interpolating  functions.  From  Theorem  4.2, 

u*N(t)  ||LOO  =  0,  (4.26) 

x*N(^\\l°°  =  0’  (4-27) 


lim  \\u*(t)  — 

N—too 

lim  I  la:*  (t)  — 

AT-s-oo11 


and  from  (4.25)  we  have 

J  (**(■),  «*(■))  <  J  (*(•),«(•))  =  lim  JN(xN(-),uN(-))  <  lim  j^V"(-),  «*"(■))■ 

N^oo  N— >oo 

Finally,  from  conditions  (4.26)  and  (4.27)  we  conclude 

J  (x*(-),  «*(■))  =  J  (®(0>  «(•))• 

Hence  (x(t),u(t))  achieves  an  optimal  cost  and  therefore  is  an  optimal  solution  to  Prob¬ 
lem  B !  □ 

Remark  4.5.  Theorems  4.3  and  4.4  provide  confidence  that  solutions  not  only  existence  to 
the  GOCM-S  and  GOCM-S,  but  solutions  will  converge  to  the  optimal  solution.  However, 
questions  still  remain  about  the  conditions  under  which  assumptions  (4.18),  (4.19),  (4.22) 
and  (4.23)  exist  ( as  pointed  out  by  Gong  el  al.  [5]).  Answers  for  similar  questions  have  been 
provided  for  the  Legendre  PS  method  by  Kang  [6],  but  like  analysis  for  Galerkin  optimal 
control  is  above  the  scope  of  this  dissertation. 
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CHAPTER  5: 

ALTERNATIVE  FORMS  OF  GALERKIN  OPTIMAL  CONTROL 


An  advantage  in  using  Galerkin  optimal  control  is  that  it  is  a  versatile  family  of 
formulations.  There  are  a  number  of  Galerkin  forms  that  can  be  used  to  suit  the  problem  at 
hand.  In  Chapter  4,  the  general  formulation,  Galerkin  optimal  control  with  strong  enforce¬ 
ment  of  end  conditions,  was  presented.  This  serves  as  the  first  of  three  global  formulations 
that  are  outlined  in  this  dissertation.  The  second  global  formulation  is  Galerkin  optimal 
control  with  weak  enforcement  of  boundary  conditions,  and  will  be  discussed  in  Section 
5.1.  (Additionally,  a  third  global  Galerkin  optimal  control  formulation  with  Legendre  test 
functions  will  be  presented  in  Chapter  6.)  Important  results  in  this  chapter  include  Theo¬ 
rems  5.1  and  5.2,  which  prove  that  nonlinear  program  Problems  GOCM-W  and  GOCM-W 
(outlined  in  Section  5.1.2)  have  feasible  solutions  to  Problem  B,  where  the  controls  may  be 
piecewise  continuous. 

Next,  the  element  based  formulations  will  be  presented  and  are  divided  into  two 
forms:  Galerkin  optimal  control  with  element-based  continuous  and  discontinuous  Galerkin 
techniques,  which  will  be  presented  in  Sections  5.2  and  5.3,  respectively.  As  alluded  to  in 
Chapter  3,  the  Galerkin  weak  integral  form  improves  feasibility  of  the  multi-scale  approach 
highlighted.  The  method  of  approximation  for  the  multi-scale  Galerkin  optimal  control  for¬ 
mulation  will  be  outlined  in  Section  5.4.  Finally,  Section  5.5  will  discuss  modifications  to 
the  Galerkin  optimal  control  formulations,  such  as  over-integration  of  the  RHS  vector  and 
the  use  of  quadrature  points  other  than  LGL,  such  as  LG  and  LGR  points. 

5.1.  Galerkin  optimal  control  with  Weak  Boundary  Condition  Enforce¬ 
ment 

The  general  Galerkin  optimal  control  strategy  presented  in  Section  4.2  describes  a 
formulation  in  which  boundary  conditions  are  enforced  in  a  strong  sense,  via  a  constraint 
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of  the  form 


\\e(xN0,xNN)\L<5N- 

Recall,  this  boundary  enforcement  method  is  also  incorporated  into  the  Legendre  PS  method 
(see  Section  3.2).  This  section  presents  an  alternative  formulation  of  Galerkin  optimal  con¬ 
trol  introduced  in  [73,  83],  one  that  allows  for  enforcement  of  the  problem  end  conditions 
in  a  weak  sense  through  the  dynamical  constraint. 

5.1.1.  Method  for  Approximation 

We  now  consider  the  Galerkin  optimal  control  formulation  with  weak  enforcement 
of  boundary  conditions.  In  this  approximation  to  Problem  B,  the  state  trajectory,  x(t), 
is  approximated  with  globally  interpolating  iV-th  order  Lagrange  polynomials, {(j)^}f=0, 
defined  on  a  grid  of  LGL  nodes,  {tj}f=0, 

N 

x(t )  «  XN(t)  =  (t)xNj . 

3=0 

Due  to  the  property  of  the  Lagrange  polynomials,  =  5ij,  we  have 

xNj  =  xN(tj ),  .7  =  0,1,...,#. 

Also,  let  uN(t)  be  an  interpolating  function  of  {uNj}jL0, 

N 

uN(t)  =  Y^(t)aNj, 

3=0 

where  is  any  set  of  continuous  functions  (not  necessarily  polynomials)  with  the 

property  ijjf  {U)  =  dl3.  As  done  previously,  taking  the  weak  integral  form  of  x—f{x,  u)  =  0 
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yields  [44] 


<!>?(*)  (  -  f(xN(t),uN(t ))  )  dt  =  0, 


i-i 


dt 


for  i  =  0, 1, . . . ,  N.  Integration  by  parts  on  the  first  term  results  in  Galerkin  weak  form, 


-1  rU,N 


d(j). 


!_  i  dt 


1  xNdt  +  \(f>^  xN~\1_1  —  I  f(xN  ,uN)dt  =  0. 


In  terms  of  our  approximating  polynomials  and  the  true  boundary  conditions  (letting  xN(— 1)  — > 
x(— 1)  and  xJV(l)  — y  x(l))  we  have 


(j)i  f(x  ,u  )dt  —  0, 


>-i 


for  i  =  0, 1, . . . ,  N.  Integration  by  parts,  yet  again,  results  in  Galerkin  strong  form  with 
weak  enforcement  of  BCs, 


N  „i  71  TV 
3=0  J~1 

/  N  > 

-4>i  (!)  ( 'Yjfi _  ^C1) 


N  \ 

^2$? (~l)xNj  -  x(-l)  I 
3=0  J 

~  J  (f>?f(xN,uN)dt  =  0. 


The  expression  may  be  simplified  as 

N 

J2Dij*Nj  +  ~  CNl  =  0; 

3=0 


for  each  i  —  0, 1, . . . ,  N,  where  the  Galerkin  differentiation  matrix,  I).  and  the  RHS  vector 
approximation,  c,  are  unchanged  from  those  given  in  the  GOCM-S  methodology — given 
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by  Equations  (4.3)  and  (4.5),  respectively — and 


xN0  —  x{— 1), 

i  =  0, 

x(l)  —  XNN  i 

i  =  N, 

o, 

i^0  ,N. 

The  BC  term  k  now  provides  a  natural  way  to  introduce  end  conditions  into  our  numer¬ 
ical  scheme.  BCs  such  as  e(x(— 1),  x(l))  =  [#(— 1)  —  x°,  x(l)  —  x?~\ 1  =  [0,0]Tcanbe 
imposed  by  defining  k  as 


xN0  —  x° 

i  —  0, 

xf  _  XNN 

i  =  N, 

o, 

for  i  —  0, 1, . . . ,  N. 


Remark  5.1.  A  similar  technique  is  discussed  by  Ross  et  al.  in  [11,  84,  85].  The  ability 
to  weakly  enforce  boundary  conditions  is  not  limited  to  the  Galerkin  formulation.  The 
collocation  method  ( in  the  context  of  the  Legendre  PS  method )  may  be  formulated  for  weak 
enforcement  of  end  conditions  by  modifying  the  discrete  differential  Equation  3.5  with  a 
boundary  condition  term  k 


N 


^ AijXN]  +Ki-  ffxN\uNl)  =0,  i  =  0, 1, . . . , N, 
j=o 


(5.1) 


where  k  is  given  by 


Hi  =  < 


—  /  XJ  —X 


f_^NN 


0 
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i  —  0, 
i  =  N, 
i  fO,N, 


for  fixed  end  conditions  e(x(—l),x(l))  =  [x(— 1)  —  x°,x(l)  —  x^1  =  [0,0]T. 

Remark  5.2.  For  existing  direct  methods  for  optimal  control  it  is  common  to  enforce  the 
problem’s  BCs  in  a  strong  sense  (or  exactly)  by  making  them  a  set  of  constraints  enforced 
by  the  nonlinear  program  (NLP)  [5].  With  the  Galerkin  optimal  control  formulation  with 
weak  boundary  enforcement,  BCs  can  now  be  enforced  in  a  weak  sense.  In  other  words, 
BCs  can  be  imposed  only  up  to  the  order  of  accuracy  of  the  numerical  approximation  itself, 
which  is  sufficient  for  many  applications.  In  the  case  of  a  problem  with  an  incomplete 
set  of  end  conditions,  such  as  an  initial  value  problem  with  condition  e(x(-l),a;(l))  = 
x(— 1)  —  x°  =  0,  k  may  be  defined  as 

( 

xN0  —  x°,  i  =  0, 

Ki  =  <  0,  i  =  N, 

0,  i  f  0,  N. 

Lastly,  for  more  complex  BCs  such  as  periodic  conditions  e(x(- l),a;(l))  =  x(-l)  - 
x(l)  =  0,  or  other  complicated  BCs  such  as  nonlinear  functions  of  x{—  1)  and  x(l),  hi 
may  be  defined  as,  n,  =  0,  for  i  =  0, 1, . . . ,  N,  and  the  condition  e(x(-l),x(l))  =  0 
may  be  enforced  as  a  set  of  constraints  by  the  NLP.  However,  this  last  case  will  result  in 
strong  enforcement  of  the  BCs  and  some  of  the  advantages  of  using  the  weak  boundary 
formulation  may  be  lost. 

The  dynamical  constraint  becomes 

N 

^ DijxNj  +  Hi  —  cNi  <6n,  i  =  0,1,...,  N. 

i=°  oc 

The  path  constraints  are  approximated  by 

h(xN\uNi)  <  5N  ■  1,  i  = 
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Lastly,  the  cost  functional  J[x(-),u(-)]  is  approximated  by  the  LGL  quadrature  rule, 


N 

J[x(-),<)]  «  JN(xN,uN)  =  J2F(xNi,um)wk  +  E(xno,xnn), 

i= 0 

where  xN  =  xN1, . . . ,  xNN]  and  uN  =  [u^0,  uN1 , . . . ,  uNN] .  To  allow  for  a  practi¬ 

cal  search  area  for  the  optimal  solution  the  following  constraints  are  added 

{xm  e  x,  um  e  U,  i  =  o,  l, . . . ,  N}, 

where  X  and  U  are  the  search  regions  that  contain  the  optimal  solution  of  the  discretized 
nonlinear  optimization. 

5.1.2.  Computation  Strategy 

In  order  to  guarantee  feasibility  of  the  discretization.  Theorems  5.1  and  5.2  show 
that  a  relaxation  of  the  dynamical  equality  constraint  to  inequality  is  required,  for  GOCM- 
W  and  GOCM-W,  respectively. 

5.I.2.I.  Computation  Strategy  for  GOCM-W 

The  computational  strategy  of  the  GOCM-W  is  to  find  the  feasible  solution  xN(t)  G 
X  and  uN(t )  G  U  for  the  following  cases: 

Case  1.  u(-)  is  piecewise  C°  and  x(-)  G  C°  and  piecewise  C1, 

Case  2.  u(-),  x(-)  G  Hm~ 1  and  m  >  2, 
that  minimizes 

J(xN(-),uN(-))  =  I  F(xN{t),uN{t))dt  +  E(xN(- l),xN(l)),  (5.2) 
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subject  to  the  Galerkin  constraints 


r: 


<f>i  (t)  {xN(t)  -  f(xN(t),uN(t)))  dt  +  Ki 


<  MN~ 


*  =  0,1,...,#, 


(5.3) 


\h+(xN(t),uN(t))\\L2  <  MN~a, 


where  a  —  1  and  (m  —  1),  for  Case  1  and  2,  respectively;  M  is  a  constant  independent  of 
N- 


h+ 


{h,  h  >  0, 

0,  h  <  0; 


and 


xN(— 1)  —  x°, 
xf  _  xJV(l), 


*  =  0, 
i  =  N, 


0,  i^0,N, 


where  e(x(—l),  a:(l))  =  x(— 1)  —  x°,  x(l)  —  xf  T  =  [0,0]T. 


(5.4) 


5.I.2.2.  Computation  Strategy  for  GOCM-W 

The  computational  strategy  of  the  GOCM-W  is  to  find  the  feasible  solution  xN  ex 
and  uN  E  U  for  the  following  cases: 

Case  1.  u(-)  is  piecewise  C°  and  x(-)  E  C°  and  piecewise  C1, 

Case  2.  u(-),  x(-)  E  Hm~ 1 ,  m  >  2  and  x(m-1)(t)is  of  bounded  variation  in  t  E 

[-1,1]. 

that  minimizes 


N 


JNlxN,iiN) 


xN\uNl)wi  +  E(x 


N 0  ~NN 


(5.5) 


i= 0 


subject  to  the  Galerkin  constraints 

II  N 


DijXN:i  +  Ki  —  cNt 

3= 0 


<  MN~a,  i 

OO 


h(xNi,uNi)  <  MN~a  •  1, 


=  0,1,...,#, 

*  =  0, 1, . . . ,  IV, 


(5.6) 


115 


where  ol  —  \  and  (m  —  1),  for  Case  1  and  2,  respectively;  M  is  a  constant  independent  of 
N- 

( 

xN0  —  x°,  i  =  0, 

=  <  x?  —  xNN,  i  =  N,  (5.7) 

0,  i  f  0,  TV, 

where  e(x(— 1),  x(l))  =  [x(— 1)  —  x°,  x(l)  —  x?]  T  =  [0, 0]T. 

5.1.3.  Feasibility  of  Solutions 

In  order  to  guarantee  feasibility  of  the  discretization.  Theorems  5.1  and  5.2  intro¬ 
duced  in  [83]  show  that  a  relaxation  of  the  dynamical  equality  constraint  to  inequality  is 
required,  for  GOCM-W  and  GOCM-W,  respectively. 

5.1 3.1.  Feasibility  of  GOCM-W 

Theorem  5.1  (Feasibility  of  GOCM-W).  Given  any  feasible  solution  1 i— >•  (x,  u),  for  Prob¬ 
lem  B,  consider  the  following  two  cases: 

Case  1.  u  ( • )  is  piecewise  C°  and  x(-)  G  C°  and  piecewise  C1, 

Case  2.  u(-),  x(-)  £  ff"1-1  and  m  >  2. 

Then,  there  exists  a  positive  integer  N0  such  that,  for  any  N  >  N0,  GOCM-W  has  a 
polynomial  feasible  solution,  (xN  (t) ,  uN  (t))  such  that 

|| x(t) -xN(t)\\L2  <  MN~a  and  ||u(t)  -  uN(t) \\l2  <  MN~a , 

where  a  —  \  and  {m  —  1),  for  Case  1  and  2,  respectively;  and  M  is  a  positive  constant 
independent  of  N. 

Proof  Let  p{t)  be  the  (N  —  l)-th  order  truncated  Legendre  polynomial  approximation  of 
x(t).  By  Lemmas  4.1  and  4.2  there  is  a  constant  c0  independent  of  N,  for  any  N  >  Nu 
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such  that 


\x(t)  -p(t)  \\l2  <  c0N  ", 


where  a  =  \  and  (m  —  1),  for  Case  1  and  2,  respectively.  Define 


xN(t)—  /  p(s)ds  +  x{— 1). 


Thenp(f)  =  xN(t)  and 


x(t)  -  xN(t)\\L2  <  2 c0N  ", 


since,  from  Holder’s  inequality  (Lemma  4.3),  we  have 


x(t)  —  xN(t)  —\  (x(s)  —  p(s))  ds  <  /  |i(s)  —  p(s)\ds 


<V2  (  /  |i(s)  -  p(s)\2ds  )  =  V2\\x(t)  -  p{t)\\L2  <  V2c0N 


Let  uN(t)  be  the  Ar-th  order  Legendre  polynomial  so  that 


u(t)-uN(t)  2  <  CiiV" 


Recall  that  the  LGL  quadrature  weights,  have  the  property  [46], 


wk<c2N  1(l-(4)2)2,  k  =  1,2, . . . ,  N  —  1, 


for  constant  c2  >  0  independent  of  k  and  iV;  for  k  =  0,  N, 


W  o,WN  = 


N(N  +  1)' 


117 


From  Theorem  4. 1  we  have,  for  each  i  —  1, . . . ,  N  —  1, 

J  </>?(!)  {xN(t)  -  f(xN(t),uN(t)))  dt  +  Ki 

=  I  ^(t)  (xN(t)  -  f(xN(t),uN(t )))  dt  <  cswfN~a, 

for  all  N  >  N2,  where  c3  is  a  constant  independent  of  N.  For  i  =  0,  we  have 

J  00  (*)  (xN{t)  -  f{xN{t),uN{t)))  dt  +  K0 

<  J  (iN(t)  -  f(xN(t),uN(t)))dt  +  ^(-l)  -x(-l)| 

<  J  <Po(t)(xN(t)-f(xN(t),uN{t)))dt<c3w^N-a, 
since  |xAr(— 1)  —  x(— 1)  |  =  0.  For  i  =  N,  we  have 

J  <j>%(t)  (; XN(t )  ~  f(xN(t),  UN(t)))  dt  +  KN 
<  J  (p%{t)  {xN(t)  -  f(xN(t),uN(t)))dt  +\xN(l) -x(l)\ 

<  czwlN~a  +  V2c0N~a, 

since 

l^-l)  —  ar(— 1)|  <  V2\\x(t)  -p(f)||L2  <  V2c0N~a. 

Finally,  for  each  i  —  0, 1, . . . ,  N, 

J  <Pk(t)  (xN(t)  ~  f(xN(t)i  uN(t)))  dt  +  Ki  <MN~a, 

for  all  N  >  Nq,  where  M  is  a  constant  independent  of  N. 

The  estimates  for  the  path  constraint  follow  from  the  proof  of  Theorem  4. 1 . 
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Thus  a  solution  (xN (t) ,  uN (t))  to  GOCM-W  is  feasible! 


□ 


5.I.3.2.  Feasibility  of  GOCM-W 

Theorem  5.2  (Feasibility  of  GOCM-W).  Given  any  feasible  solution  t  H >  (x.  u),  for  Prob¬ 
lem  B,  consider  the  following  two  cases: 

Case  1.  u(-)  is  piecewise  C°  and  x(-)  £  C°  and  piecewise  C1, 

Case  2.  uf),  x(-)  £  H m~1  and  m  >  2. 

Then,  there  exists  a  positive  integer  N0  such  that,  for  any  N  >  N0,  GOCM-W  has  a  feasible 
solution,  (xN,uN)  such  that 


II x(t)-xN{t)\\L2  <  MN~a , 

where  a  —  \  and  (m  —  1),  for  Case  1  and  2,  respectively;  and  M  is  a  positive  constant 
independent  of  N .  Additionally,  uN {ti )  =  u{U),  for  i  —  0, 1, . . . ,  N. 

Proof.  Let  p(t) be  the  (N  —  l)-th  order  truncated  Legendre  polynomial  approximation  of 
x(t)  in  the  L2-norm.  By  Lemma  4.1  there  is  a  constant  d0  independent  of  N,  for  any 
N  >  Ni,  such  that 


||T(t)  -p(t)\\Loo  <  d0N  /3, 

where  /3  —  (m  —  |),  for  Case  2.  For  Case  1  we  refer  to  [77-79]  which  show  the  truncated 
Legendre  approximation  for  discontinuous  functions  with  jump  discontinuity  (such  as  the 
step  function  defined  in  Lemma  4.2)  displays  Gibbs  phenomenon.  However  the  maximum 
amplitude  of  the  overshoot  has  a  finite  limit;  we  conclude  that  for  Case  1,^  =  0.  Also,  by 
Lemma  4.2  there  is  a  constant  d\  independent  of  N,  for  any  N  >  A7,  such  that 

||®(*)  ~p(t) ||L2  <  d!N~a, 
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where  a  =  \  and  (m  —  1),  for  Case  1  and  2,  respectively.  Define 


x  (t)  =  I  p(s)ds  +  x(— 1). 


Then  p(t)  =  xN(t)  and 


x(t)  —  xN(t )  L2  <  2 d\N  Q, 


since,  from  Holder’s  inequality  (Lemma  4.3), 


x(t)  —  xN  (t)  = 


(T(s)  —  p(s))  ds 


i 


<  /  |x(s)  —  p(s)\ds 


i- i 


<\/2  (  /  |x(s) -p(s)|Jds  )  =  V2\\x(t)  —  p(f)||L2  <  V2diN~ 


Also,  let  uN(t )  be  an  interpolating  function  of  w(t), 


N 


uN(t)  =  J2^(t)uNi, 

where  {^jVI^Lq  is  any  set  of  continuous  functions  with  the  property 
therefore 

uN>  =  u(tj). 

Since  xN(t)  is  a  N- th  order  polynomial,  we  have 


N 


X 


iN(U)  =  X/ A 

i=0 


<5jj,  and 
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where  A  is  the  (N  +  1)  x  (N  +  1)  Legendre  PS  differentiation  matrix  (2.57)  and 


xm  =  xN(U). 

Recall  that  the  LGL  quadrature  weights,  {wi}^=0,  have  the  property  [46], 

w.i  <  d2N~1(  1  -  (L)2)L  i  —  1,2, . . . , N  —  1, 


N 

y^jD0jxNj  +  k0  -  cno 

3=0 
N 

<  J2D0jXNj  -  CN0  +  ^(-l)  -  x(-l)| 
3=0 

<  d3w0iV_/3, 
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since  |a;Ar(— 1)  —  x(— 1)|  =  0.  Fori  =  N, 

N 

^ DNjXNj  +  Kn  —  CNN 
3=0 
N 

<  DNjxNj  -  cNN  +  |x(l)  -  ^(1)1 

3=0 

=  dzWNN~P  +  V2d2N~a. 


Finally,  for  each  i  —  0, 1, . . . ,  N, 

N 

J2DijXNj  +  m-  cNi  <  MN~a, 

3=0 

for  all  N  >  N0,  where  M  is  a  constant  independent  of  N. 

The  estimates  for  the  path  constraint  follow  from  the  proof  of  Theorem  4.2. 

Thus  a  solution  (xN,  uN )  to  GOCM-W  is  feasible!  □ 

5.2.  Galerkin  Optimal  Control  with  Continuous  Element-based  Galerkin 


We  now  consider  a  continuous  element-based  Galerkin  approach. 

5.2.1.  Method  for  Approximation 

In  this  approximation  to  Problem  B,  the  weak  integral  form  of  x  —  f(x,  u)  =  0  in 
each  element,  Qe,  takes  the  form  [44] 


dx^N(t) 

dt 


f(x{e)N(t),u^N{t)))  dt 


0, 


where  VL  =  IJ^i  -  -f:  defines  the  total  domain.  The  state  trajectory,  x(t),  is  approximated 
inside  each  element,  Qe,  by  interpolating  Ar-th  order  Lagrange  polynomials,  ^ 
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at  the  nodes  {t^}f=0  by  the  relationship 


N 

3=0 

for  e  =  1,  2, ,  Ne,  where  {t^}jLQ  are  the  LGL  nodes,  {£j}f=0,  mapped  back  to  the 
physical  space  inside  each  element,  Vte.  Also,  let  uN(t)  be  an  interpolating  function  of 


N 

u&N(t)  =  '%y>$e)N  (t)u&Nj , 

3=0 


where  {'tpf'1  (t)}jL0  are  any  set  of  continuous  functions  with  the  property 


Therefore  x^Ni  =  x^N(tf  ),  for  e  =  1,  2, . . . ,  Ne  and  j  =  0,1 , ,N,  and  similarly, 
u^Nj  =  v{e)N (1('i  )  ).  The  relationship  between  the  physical  time  domain,  t  G  [f0,  tj]  = 
,  and  the  computational  space,  £  G  [—1, 1],  is  given  by  [44] 


M)  A**) 

t0  )  bN 


2 

A  f(e) 


1  and  d £ 


2 

A 


dt, 


and  conversely, 

At ^  ( \  A 

t  =  ~y~  (f  +  1)  +  t o  and  dt  = 

where  At(e)  =  t$  —t^  is  the  size  of  each  element,  Q,e,  which  can  be  nonuniform  in  length. 
The  Lagrange  polynomial  defined  on  the  LGL  computational  domain  is  given  by 

0f«)  =  *  =  0, ... ,7V. 

j=0^i  s/i 
14* 
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The  state  trajectory,  x,  can  now  be  approximated  inside  each  element,  Qe,  by 


N 

x^(()  = 

3=0 

where  {<Af(0}yLo  are  the  Lagrange  polynomials  defined  on  the  LGL  grid.  Likewise,  uN(£) 
is  given  by 

N 

u«>K(o  = 

3=0 

where  V'f  (6)  = 

Remark  5.3.  In  this  formulation  x^NN  =  x(e+0No  and  u^NN  =  u^e+1^N0,  far  e  = 
1,2. ....  Ne  —  1.  This  continuity  condition  is  a  consequence  of  the  global  formulation  of 
the  problem  discussed  in  Remark  5.4. 


In  the  computational  domain,  £,  the  system  becomes 

/!  ^(Odxl‘^(0JZ  -  222  =  o. 

for  e  =  1,  2, . . , , Ne  and  i  =  0,1, ...  ,N,  and  in  terms  of  the  approximating  polynomials 
becomes 


N 

E 

3=0  ' 


kN 


1 


dfLd5  x{e)Nj  -  At(C) 
df^X  2 


<j>?f{xUNMe)N)dS  =  0. 


'-i 


In  matrix-vector  notation,  our  system  can  be  expressed  as 


N 

Y,df*(em  -  4e) 

3=0 


=  0,  i  =  0,1,...,  TV, 


for  e  =  1,  2, . . . ,  Ne  and  i  —  0, 1, ,  N,  where  the  local  element  (N+ 1)  x  (iV+1)  Galerkin 

differentiation  matrix,  I)ief  is  the  same  as  that  defined  in  Equation  (4.3).  If  Q  =  N  LGL 
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quadrature  nodes  are  used,  the  approximation  to  the  (N  +  1)  x  1  RHS  vector  simplifies  to 


d(e)Ni 


A 


f(x^Ni,U^Ni)Wi 


0,1, 


,  N, 


for  e  =  1,2,...,  iVe  and  i  =  0, 1, . . . ,  N,  where  the  size  of  each  element,  AtM\  can  be 
nonuniform  in  length. 

Remark  5.4.  So  far  the  required  objects  have  been  identified  to  solve  the  system  numer¬ 
ically  with  element-based  Galerkin.  However,  since  nodal  basis  functions  are  continuous 
across  element  boundaries  and  LGL  nodes  include  both  endpoints,  a  globed  solution  to  our 
problem  can  be  found.  To  do  this,  a  globed  assembly  or  direct  stiffness  summation  can  be 
done,  where  the  direct  stiffness  summation  operator  is  f\^lv  [44] 

The  global  equations  to  the  problem  become 

Np 

J^DuxN*J  -  c*’7  =  0,  I  =  1, . . . ,  Np. 
j= i 

The  global  Galerkin  differentiation  matrix,  Du  and  RHS  vector,  cNpI  are  then  defined  by 

Ne  Ne 

Du  =  /\D\f,  and  cN*T  =  /\c^Ni, 

e=l  e=l 


where  Np  =  ( NeN  +  1)  is  the  total  number  of  grid  points.  Note  that  the  direct  stiffness 
summation  operator  does  the  mapping  ( i,e ),  (j,e)  — >  I,  J  [44].  See  Section  2.3.2. 1  for 
additional  details.  The  dynamical  constraint  becomes 


Nv 


Y.Dux^-c* 


j=  i 


<S,  1  =  1,2, 


,Nn 
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The  endpoint  conditions  and  path  constraints  are  approximated  by 


c(x^p^  xNpNp 


oo 

h(xNpI,uNpI)  <  5- 1,  I  =  l,2,...,Np. 


<  6 


Lastly,  the  cost  functional  J[x(-),  «(•)]  is  approximated  by  the  LGL  quadrature  rule, 


J[x(-),u(-)}  mJN(xNp,uNp) 

Ne  A/(e)  N 

-E^En 


e=l 


i= 0 


where  xNp  =  [xNpl,  xNp2, . . . ,  xNpNp]  and  uNp  =  [uNpl ,  uNp2 , . . . ,  uNpNp] .  To  allow  for  a 
practical  search  area  for  the  optimal  solution  the  following  constraints  are  included:  xNp  £ 
X  and  uNp  £  U,  where  X  and  U  are  the  search  regions  that  contain  the  optimal  solution 
of  the  discretized  nonlinear  optimization. 


5.2.2.  Computation  Strategy 

The  computational  strategy  of  the  GOCM-CG  is  to  find  the  feasible  solution  xNp  £ 
X  and  uNp  £  U  that  minimizes 

JN(xNp,UNp)  —'Y^——'^2F(^Np^e~1^N+1+i\^'Np^e~1'>N+1+i'>)Wi 

e=l  i=0 

+E(xNp0,xNpNp), 


subject  to  the  Galerkin  constraints 


nv 


YPijx 

3=0 


NPJ  _  -NI 


\e{x p ,  x 

I  '  7  /  1 1 OO 

h(xNpI,uNpI) 


<SN ,  /  =  1,2,...,XP, 

<^•1,  I  =  1)  2, ,  Np, 
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where  5N  is  the  feasibility  tolerance,  which  is  dependent  upon  N. 


5.3.  GOCM  with  Discontinuous  Element-based  Galerkin 

We  now  consider  a  discontinuous  element-based  Galerkin  approach  introduced  in 

[74], 


5.3.1.  Method  for  Approximation 

In  this  approximation  to  Problem  B,  the  weak  integral  form  of  x  —  f(x,  u)  =  0  in 
each  element,  Qe,  yields  [44] 

ja  4>N(t)  (^-777  _  f(xM*(t),uMN(t)^  a  =  o, 

where  VL  =  IJ^i  defines  the  total  domain.  The  state  trajectory,  x(t),  is  approximated 
inside  each  element,  Qe,  by  interpolating  Ar-th  order  Lagrange  polynomials,  {4>^N {t)}jL0, 
at  the  nodes  {t^}jL0  by  the  relationship 

N 

x(e)w(f)  = 

3=0 

for  e  =  1,  2, ,  Ne,  where  0  are  the  LGL  nodes,  {£j}f=0,  mapped  back  to  the 

physical  space  inside  each  element,  f2e.  Also,  let  uN(t)  be  an  interpolating  function  of 

N 

u^N{t)  =  (t)u{e)Nj , 

3=0 

where  {ipje^N (t)}jL0  are  any  set  of  continuous  functions  (not  necessarily  polynomials)  with 
the  property  ip^N  (ti)  =  5^.  Therefore  x^Nj  =  x^N(t^),  for  e  =  1,2,  ...,iVe  and 
j  =  0, 1, . . . ,  N,  and  similarly,  n(e>Nj  =  u(e)N The  relationship  between  the  physical 
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time  domain,  t  G  [t0,  tf]  =  t^\  ,  and  the  computational  space,  £  e  [—1, 1],  is  given 

by  [44] 


£ 


2 

A  Ae) 


1  and  =  A  ,  ,  dt, 
A  Ae) 


and  conversely, 

At(e)  ,  n  At^e^ 

t  =  — (f  +  1)  +  4e)  and  dt  = 

where  A  =  tff  —  t^  is  the  size  of  each  element,  Qe,  which  can  be  nonuniform  in  length. 

The  Lagrange  polynomial  defined  on  the  LGL  computational  domain  is  given  by 

j= 0^ 

j¥=i 

The  state  trajectory,  x,  can  now  be  approximated  inside  each  element,  Oe,  by 

N 

*<'>wK)  =  X>f  K)*<e)NJ, 

3=0 

where  {<$(€)}  f=o  are  the  Lagrange  polynomials  defined  on  the  LGL  grid.  Likewise,  uN(£) 
is  given  by 

N 

u<*>k(o = x;^k)scw, 

J=0 

where  V;jv(6)  —  <% .  In  the  computational  domain,  A  the  system  becomes 
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for  e  =  1,  2, . . . ,  Ne  and  i  =  0, 1, . . . ,  N.  Integration  by  parts  on  the  first  term  yields  the 
weak  form  relationship 


dfl 


N 


di 


x{e)N  d£+  [0f  x(e)JV]^ 


A 

2 


i: 


0, 


and  in  terms  of  our  approximating  polynomials,  we  have 


N 

J  i  d£ 

j=oJ~ 1  s 


1  N  ,  A  j.(e)  r  l 

#i  4>fdi  +  V  [^f  ]  j  x<’>  -  «“>")#  =  0. 


i=o 


f-i 


Remark  5.5.  With  the  discontinuous  element-based  Galerkin  approach,  we  let  x,  u  and  the 
basis  functions  be  discontinuous  across  element  edges.  A  numerical  flux  term  x(*)  acts  as 
a  jump  condition  between  elements  [44].  Here,  we  consider  the  centered  flux  relationship, 
x W  =  |  ( x ^  +  x^),  proposed  by  Delfour  et  al.  [66],  where  e  and  q  denote  the  element 
and  its  neighbor,  respectively. 


Integrating  by  parts,  yet  again,  results  in  the  Galerkin  strong  form  relationship 
A  /■!  A+(e)  A 

£/  +  <tfnxW,uW)dt;  =  o. 

j= o ^_1  ^  ^_1 


for  e  =  1,2 , ,Ne  and  i  =  0, 1, . . . ,  N.  Since  LGL  nodes  are  used,  the  boundary  term, 
rj(e\  may  be  simplified  as 


j  |  (x^N0  -  x^NN)  ,  i  —  N, 

(o,  iflN, 


1  ^(Ne)N0  _  ^{Ne~ 

0, 


-1)NN^ 


i  =  0, 
i  f  0, 
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for  elements  f2e  =  fix  and  Q,ve,  respectively,  and  for  each  other  element  (Qe  ^  Q  | .  Q  Vf  ) 
we  have 


(e)  . 

Vi  =  < 


1  (rj.(e)N 0  _  7f.(e- 


Xy 


0, 


-1  )NN^ 
)NN j 


(e 


i  =  0, 
i  =  N, 
i  f  0,  N. 


Remark  5.6.  The  problem ’s  endpoint  conditions  have  not  been  introduced  into  the  bound¬ 
ary  term,  rj<e>,  and  will  instead  be  enforce  in  a  strong  sense  through  a  set  of  endpoint 
constraints,  as  done  in  GOCM-S.  If  instead  BCs  are  to  be  enforced  weakly,  a  modification 
can  be  made  to  the  boundary  condition  term,  rj^e\ 


In  matrix-vector  notation,  our  system  may  be  expressed  as 


N 

^  -  c*(6)  =  °>  e  =  1,  2, . . . ,  Ne,i  =  0, 1, . . . ,  TV, 

3=0 


where  the  local  element  (N  +  1)  x  (N  +  1)  Galerkin  differentiation  matrix,  I)ie),  is  the 
same  as  that  defined  in  Equation  (4.3).  If  Q  —  N  LGL  quadrature  nodes  are  used,  the 
approximation  to  the  (N  +  1)  x  1  RHS  vector  simplifies  to 


-(e)  TVi 


A 


f(x^Ni,u^Ni)Wi,e=l,2r....,Ne, 


0,1 


where  the  size  of  each  element,  At(p:K  can  be  nonuniform  in  length.  The  dynamical  con¬ 
straint  becomes 


N 

E 

3=0 


D^x{e)Nj 


-(e)  JVi 


<5,  e  =  1,2, . . .  ,jVe,  i  =  0, 1, . . . ,  N. 
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The  endpoint  conditions  and  path  constraints  are  approximated  by 


elx0  >  XN  > 


<£, 


Kxt\  uie))  <  S  ■  !,  e  =  l,2,...,Ne,  i  =  0,l,...,N, 


Lastly,  the  cost  functional  J[x(-),u(-)]  is  approximated  by  LGL  quadrature  rule, 

J[x(-),  «(•)]«  JN(xN,nN)  =  jW™), 

e=l  fc=0 


where 


XN=[x^N0,.. 

=(1)JVJV 

•  >  x  )  •  ■ 

^(Ne)N0 

•  i  x  )  •  ■ 

-(Are)AWj  , 

uN  =  [u^N0,.. 

.,u^N0,.. 

To  allow  for  a  practical  search  area  for  the  optimal  solution  the  following  constraints  are 
included:  xN  £  X  and  uN  £  U,  where  X  and  U  are  the  search  regions  that  contain  the 
optimal  solution  of  the  discretized  nonlinear  optimization. 


5.3.2.  Computation  Strategy 

The  computational  strategy  of  the  GOCM-DG  is  to  find  the  feasible  solution  xN  £ 
X  and  uN  £  U  that  minimizes 


Ne  A  J.(e)  ^ 

JN(xN,uN)  =  £  —  J2F^{e)Ni^e)N>i  +  E(x^N0,x^NN), 


e=l 


i=0 
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subject  to  the  Galerkin  constraints 


N 


£4 


(6)  X^Nj  —  r/e)  — 


Vi  -  c ' 


3= 0 


<^,  e  =  l,2,...,iVe,  i  =  0,l,...,7V, 


|e(^(1)iV0^(ive)iViV)||oo<5iVj 

/r(x(e)iV\  u{e)Ni )  <  ^  •  1,  e  =  1, 2, . . . ,  Ne,  i  =  0, 1, . . . ,  N, 


where  5N  is  the  feasibility  tolerance,  which  is  dependent  upon  N. 

5.4.  Galerkin  Optimal  Control  for  Multi-scale  Problems 

In  this  section,  a  multi-scale  Galerkin  optimal  control  approach  is  proposed  to  solve 
a  specific  optimal  control  problem,  one  in  which  the  system  dynamics  are  of  different 
timescales  (see  Problem  B).  Such  a  problem  may  consists  of  a  fast  state,  Xf(t),  associated 
with  the  fast  dynamics  and  a  slow  state,  xs(t),  associated  with  the  slow  dynamics.  GOCM- 
S  (see  Chapter  4)  serves  as  the  basis  for  the  Galerkin  multi-scale  approach  to  Problem  B. 
In  particular,  system  (4.6)  is  fundamental  to  this  alternative  Galerkin  optimal  control  for¬ 
mulation  described  below.  The  outline  shown  here  follows  the  strategy  given  by  Gong  et 
al.  in  [71]. 

5.4.1.  Method  for  Approximation 

The  states  and  controls  are  approximated  with  globally  interpolating  Lagrange  poly¬ 
nomials  on  different  LGL  timescales.  The  slow  state,  xs(t),  is  approximated  on  sparse  grid 
{Tj}jL0  while  the  fast  state,  Xf(t),  on  dense  grid  {tj}jL0,  where  M  <  N.  The  slow  and 
fast  states  are  defined  by  the  following  approximating  polynomials 

M 

Xs(t)  ~  xf{t)  =  'Yjjtf  (t)xf3, 

3=0 

N 

Xf(t )  ~  xf  (■ t )  = 

3=0 
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where  the  Lagrange  polynomials  {fif1  (t)}^  and  {(j>f  (t)}f=0  are  defined  on  grids  {rj}jf0 
and  {tj}f=0,  respectively.  Let 


xf- 1  «  xs{Tj),  j  —  0,1, . . . ,  M, 
xfJ  «  Xf(tj).  j  -  0. 1,  - ,  • ,  .V. 

and  similarly,  u,Vj  ~  for  j  =  0,1,...,  iV. 

Remark  5.7.  For  simplicity,  the  control  variable,  u(t),  is  approximated  on  the  dense  grid 
{L}f=0,  however  this  need  not  be  the  case.  Modifications  may  be  made  to  the  process 
outlined  in  this  section  to  cast  the  control  onto  a  unique  grid,  such  as  sparse  grid  {fk}lL0, 
where  M  <  N  [71]. 


For  GOCM-MS,  a  solution  to  the  differential  equations  xs  =  f(xs,Xf,u)  and  Xf  = 
g(xs,Xf,u )  may  be  approximated  by  discretizing  the  slow  dynamics  over  the  dense  grid 
with  the  following  formulation 


M 

-  cf 

3=0 


N 

E 

3=0 


D™xfj 


- Ni 


0,  i  =  0, 

0,  i  —  0, 1, . . . ,  N. 


The  (N  +  1)  x  (M  +  1),  non-square ,  differentiation  transformation  matrix  DNM  and  (N  + 
1)  x  (N  +  1),  square ,  differentiation  matrix  DNN  are  defined  by 


F)NM  _ 

U ij  ~ 


dnn  - 
U ij 


'  I 

r  1 


dt 

,d(j)N 


d(bM 

dt  =  —r-(ti)wi  =  AfMWi,  i  =  0, 1, . . . ,  N,  j  =  0, 1, . 


lN^J  j+_^3 


dt 
dfif 


M, 


aN  _J_3_  dt  =  _ 

_ll  dt  dt 


(ti)wi  =  AfN  Wi,  i,j  =  0,1, 


N, 
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where  {wi}^L0  are  the  LGL  weights  associated  with  LGL  points  {L};=0,  ANN  is  the 
standard  (N  +  1)  x  (N  +  1)  Legendre  PS  differentiation  matrix  (2.57)  and  ANM  is  the 
(N  +  1)  x  (M  +  1)  Legendre  PS  differentiation  transformation  matrix  (2.39).  The  RHS 
vectors,  cf  and  &f,  are  defined  by 


cf  =  /(£f ,  xf ,  =  0,  i  =  0, 1, . . . ,  TV, 

=  g(x^\  x uNl)w.i  =  0,  i  —  0, 1, . . . ,  N. 


The  slow  state  approximation  projected  to  the  dense  grid,  may  be  calculated  by  the 
linear  mapping  T^M  =  {tj)  with  the  relationship 


n 

%Ni  =  S^TNM-Mj 

S  /  j  IJ  s  ’ 

3=0 


for  %  —  0, 1, . . . ,  N,  where  TNM  is  the  (N  +  1)  x  (M  +  1)  transformation  matrix  (2.37). 

Remark  5.8.  Projecting  the  slow  dynamics  onto  the  dense  grid  provides  a  way  of  capturing 
the  high  frequency  information  of  the  fast  state.  If  instead  the  intuitive  approach  is  used,  of 
discretizing  the  slow  dynamics  over  the  sparse  grid,  the  high  frequency  information  of  the 
fast  state  is  lost,  resulting  in  a  decrease  in  method  accuracy  [71]. 

The  dynamical  constraints  therefore  become 


M 

-Ni 

3=0 

N 

Z.DAA’ 

-Ni 

~  Cf 

3=0 

i  —  0, 1, . . . ,  N, 
i  —  0, 1, . . . ,  N. 


The  endpoint  conditions  and  path  constraints  are  approximated  similarly  by 


P(~M 0  -MM  -NO  -NN 


<  8 


N 


h(xf\  xNf\  uNl)  <  •  1,  i  =  0, 1, . . . ,  N. 
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Lastly,  the  cost  functional  J[x(-),  «(•)]  is  approximated  by  the  LGL  quadrature  rule, 


N 


JN(xf,xf,uN)  = 


y ^F(xf\  x fl,  uNl)wi  +  E(x 

i= 0 


-MO  -MM  -NO  -NN\ 
s  1  •’  s  )  •*'  f  :  •'  f  / ) 


where 


-MO  -Ml 

s  5  'Eg  i  ■ 


-MM' 

•'s  J  i 


x 7 


=  X 


NO  -NI 


>  Xf 


~NN- 

’Xf  J 


and  ma  =  ?iAM , 


,u 


NN~\ 


To  allow  for  a  search  area  for  the  optimal  solution  the  following  constraints  are  included: 
xf  g  xa,  xf  G  X  f  and  uN  G  U,  where  Xs,  X  f  and  U  are  the  search  regions  that 
contain  the  optimal  solution  of  the  discretized  nonlinear  optimization. 


5.4.2.  Computation  Strategy 

The  computational  strategy  of  the  GOCM-MS  is  to  find  the  feasible  solution  xf  G 
Xa,xf  G  X  f  and  uN  G  U  that  minimizes 


N 


N/-M  —  N  -IV 


JN(x 


Xf,U  )  = 


i= 0 


X 


Ni  ~Ni  -Ni 


,  X 


f 


U 


)w. 


+  E(x 


MO 

s  - 


MM  -NO  ~NN 


,  X 


f  ’ Xf 


subject  to  the  Galerkin  constraints 


M 

-Ni 

3=0 

N 

-Ni 

Cf 

3=0 

<<5* 


<SN, 


e(xf°,xfM,xf°,xfN)\\oo<6N, 


i  —  0, 1, . . . ,  N, 


K 


+Ni  -Ni  -Ni\ 
^  s  1  ^  f  J  ^ 


i  —  0,1, . . . ,  N, 


where  5N  is  the  feasibility  tolerance,  which  is  dependent  upon  N. 
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5.5.  Modifications  to  Galerkin  Optimal  Control 


Two  modifications  to  the  general  Galerkin  optimal  control  formulation  will  be  dis¬ 
cussed  in  this  section:  over-integration  of  the  RHS  vector  and  the  use  of  quadrature  points 
other  than  LGL,  such  as  LG  and  LGR  points. 

5.5.1.  Over-Integration  of  the  RHS  Vector 

Thus  far,  all  the  Galerkin  optimal  control  formulations  discussed  have  approxi¬ 
mated  the  RHS  vector  with  inexact  integration,  and  N  +  1  quadrature  points.  It  may, 
however,  be  advantageous  to  approximate  the  RHS  vector  with  increased  accuracy.  This 
may  be  accomplished  by  over-integration  of  the  RHS  vector  while  approximating  the  in¬ 
tegral  using  LGL  quadrature.  As  outlined  in  the  GOCM-S,  the  state  trajectory,  x(t),  is 
approximated  by 

N 

x(t)  «  XN(t)  =  (t)xNj. 

3=0 

Let 


xNj  «  x(tj),  j  =  0,l,...,N, 

and  similarly,  vN3  is  the  approximation  of  u(tj).  In  the  Galerkin  optimal  control  formu¬ 
lation,  a  solution  to  the  differential  equation  x  —  f(x,  u)  =  0  may  be  approximated  at  the 
LGL  nodes  with  the  following  weak  integral  formulation  [44] 

dt  =  °> 

and  in  terms  of  the  approximating  polynomials  becomes 

[  ^~dTdt^Nj  ~  l  $ f(.xN ,uN)dt  =  °- 

j  ii  •'  1  •'  1 
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In  matrix-vector  form  the  system  becomes 

N 

YPijXNj  —  °i  =  0,  i  =  0, 1, . . . ,  N, 

3=0 

where  the  (N  +  1)  x  (Ar  +  1)  differentiation  matrix  D  is  defined  by  Equation  (4.2)  and 
calculated  exactly  by  Equation  (4.3).  The  (N  +  1)  x  1  RHS  vector  c  is  defined  as 

Ci  =  J  </>?{t)f{xN{t),uN{t))dt,  i  =  0, 1,...,N, 

and  can  be  approximated  with  LGL  quadrature  by  the  relationship 

Q 

^  «  cNl  =  Yffl(tk)f(xN(tk),uN(tk))wk,  i  —  0,1, ,  N, 
k= o 

where  {wk}Y  are  the  LGL  quadrature  weights  (2.49)  associated  with  LGL  points,  (ffc}^=0. 

Remark  5.9.  Recall  that  for  LGL  quadrature  rule,  Q  =  N  +  1  integration  points  will 
integrate  the  RHS  vector  exactly  when  f(x(t),u(t ))  is  linear  in  x ( t )  and  u(t).  In  the  case 
of  a  nonlinear  function,  f,  the  accuracy  of  the  numerical  integration  of  the  RHS  vector  may 
also  be  improved  ifQ  =  N  +  l  integration  points  are  used  when  f  consists  of  one  or  more 
nonlinear  terms. 

Using  over-integration,  the  RHS  vector  approximation  becomes 

JV+l 

cNl  =  uN(tk))wk,  i  =  o,  1, . . . ,  N, 

k= 0 

where  are  the  LGL  quadrature  weights  associated  with  LGL  points,  {U-}/^'  -The 

dynamical  constraint  therefore  becomes 

N 

YDipNj-cNi  <SN,  i  =  0,1,...,  N. 
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The  endpoint  conditions  and  path  constraints  are  approximated  similarly  by 

h(xNi,um)  <SN  ■  1,  i  =  0,1,...,  N, 

where  1  denotes  [1, . . . ,  1]T .  Lastly,  the  cost  functional  J  [&(•),  «(•)]  is  approximated  by  the 
LGL  quadrature  rule, 

N 

J[z(-)X-)]  ~  JN(xN,uN)  =  J2F(xN\um)wi  +  E(xm,xNN), 

i= 0 

where  xN  =  [x^0  xN1  ■  ■  ■  xNN ]  and  uN  =  [uN0  uN1  ■  ■  ■  uNN] .  To  allow  for  a  practical 
search  area  for  the  optimal  solution  the  following  constraints  are  included:  xN  G  X  and 

uN  e  U. 

5.5.2.  Galerkin  Optimal  Control  with  LG  and  LGR/F-LGR  Quadrature  Points 

The  Galerkin  weak  formulation  with  weak  boundary  condition  enforcement  also 
allows  for  consideration  of  quadrature  points  that  do  not  include  both  endpoints,  e.g.,  LG 
and  LGR  (or  F-LGR)  nodes.  This  is  advantageous  since  LG  and  LGR  quadrature  rules  may 
lead  to  increased  accuracies  when  performing  the  RHS  vector  integration.  Recall  that  LG 
quadrature  rule  integration  is  exact  for  polynomial  integrands  of  degree  less  than  or  equal 
to  2N  +  1,  where  the  LG  nodes,  {4}^=0,  are  defined  by  —  1  <  t0  <  ■  ■  ■  <  tN  <  1  and 
are  the  roots  of  Equation  (2.41).  LGR  (and  F-LGR)  quadrature  rule  is  exact  for  polynomial 
integrands  of  degree  less  than  or  equal  to  2N,  where  the  LGR  nodes,  are  defined 

by  t0  =  -1  <  t\  <  ■  ■  ■  <  Lv  <  1,  and  are  the  roots  of  Equation  (2.43)  and  the  F-LGR 
nodes  are  the  negative  of  the  LGR  points. 
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5.5.2. 1.  Method  for  Approximation  for  Galerkin  Optimal  Control  with  LGR/F-LGR 
Nodes 

In  this  section,  a  formulation  is  proposed  to  solve  a  specific  optimal  control  prob¬ 
lem,  one  in  which  the  initial  or  final  conditions  of  the  problem  dynamics  are  provided  (not 
both).  Consider  Problem  B  such  that  one  of  the  two  following  cases  exist: 

Case  1:  e(x(— 1), x(l))  =  x(— 1)  —  x°  =  0, 

Case  2:  e(x(— 1),  x(l))  =  x{l)  —  xf  =  0, 

where  a;0  and  xJ  are  constants.  As  with  the  GOCM-W,  we  will  consider  the  weak  enforce¬ 
ment  of  the  end  conditions,  however,  now  we  will  use  the  Galerkin  weak  form.  In  this 
approximation  to  Problem  B,  the  state  trajectory,  x(t),  is  approximated  with  globally  inter¬ 
polating  iV-th  order  Lagrange  polynomials,  {©f  }jLo,  defined  on  a  grid  of  F-LGR  or  LGR 
nodes,  {tj}jL0,  for  Case  1  and  2,  respectively, 

N 

x(t)  ~  XN(t)  =  (t)xNj . 

3=0 

Due  to  the  property  of  the  Lagrange  polynomials,  (p™  {U)  =  8ij,  we  have 

xNj=xN(tj),  j  =  0,l,...iN. 

Also,  let  uN(t)  be  an  interpolating  function  of  {uNj}jL0, 

N 

uN(t)  = 

j= o 

where  {ip^}f=0  is  any  set  of  continuous  functions  with  the  property  =  dtJ.  Taking 

the  weak  integral  form  of  x  —  f(x,  u)  =  0  yields  [44] 

~  dt  =  °> 
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for  i  —  0, 1, . . . ,  N.  Integration  by  parts  on  the  first  term  results  in  Galerkin  weak  form, 


d(j). 


i- 1 


dt 


xNdt+[^xN]l_1 


~  <j)?f(xN,uN)dt  =  0. 


i 


In  terms  of  our  approximating  polynomials,  we  have 


Ylj  ^~^idtxNj  ~  ^ (^1)xN (-1)  +  0f(1)a;JV(1)  -  J  (t>i  f(xN ,uN)dt  =  0, 


for  i  =  0,1 , ,N.  The  expression  may  be  simplified  as 


y ^pijXNj  +  hi  -  cNl  =  0, 
i=o 

for  each  i  —  0, 1, . . . ,  N,  where 

l)x°,  i  ±  N, 

l)a:0  +  xNN,  i  —  N, 

for  Case  1  and 

j  <J)q  (l)x^  —  xN0, 

for  Case  2. 

The  (N  +  1)  x  (N  +  1)  differentiation  matrix  D  is  defined  by 
Dtj  =  -  j'  =  0, 1, ...  ,7V. 


1  =  0, 
i  ±  0, 
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If  F-LGR/LGR  quadrature  rule  is  used  with  Q  =  N  quadrature  points,  the  differentiation 
matrix,  D,  can  be  calculated  exactly  by  the  relationship 


ij  ^  dt 

k= 0 


d<Pi 


N 


dt 


(t 


Dwi 


=  -Au<rJ-' 


i,j  =  0,1 


where  {wj}jL0  are  the  F-LGR/LGR  quadrature  weights  (2.50)  and  AtJ  =  is  the 

F-LGR/LGR  PS  differentiation  matrix. 

The  RHS  vector,  c  can  be  approximated  by  the  relationship 

Q 

Ci  ~  cNl  =  jtk)f(xN(tk),  uN(tk))wk  =  f(xNl,um)wi,  i  =  0, 1, . . . ,  N. 

k= 0 

Remark  5.10.  Recall  that  for  F-LGR/LGR  quadrature  rule,  integration  is  exact  for  polyno¬ 
mial  integrands  of  degree  less  than  or  equcd  to  2N.  If  Q  =  N  integration  points  are  used, 
the  RHS  vector  will  integrate  exactly  when  f(x(t),u(t ))  is  linear  in  x(t)  and  u(t ).  In  the 
case  of  a  nonlinear  function  f,  the  accuracy  of  integration  may  be  improved  by  increasing 
the  number  of  quadrature  points  Q  (See  Section  2.2.2). 

The  dynamical  constraint  becomes 


N 

y^DijxNj  +  hi-  cNl 


3=0 


<  S 


N 


i  =  0, 1, . . . ,  N. 


The  path  constraints  are  approximated  by 


h(xN\uNi)  <5N  ■  1,  i  —  0,1, . . . ,  N. 


Lastly,  the  cost  functional  J[x(-),u(-)]  is  approximated  by  the  F-LGR/LGR  quadrature 
rule, 

N 

J[xf),u(-)}  «  JN(xN,uN)  =  J2F(xN\uNi)wi  +  E(xm,xNN), 

i= 0 
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where  xN  =  [x^0  xN  1  •  •  •  xWff]  and  uN  =  [u™  uN1  ■  ■  ■  uNN] .  To  allow  for  a  practical 
search  area  for  the  optimal  solution  the  following  constraints  are  included:  xN  G  X  and 

uN  e  U. 

Remark  5.11.  Note  that  the  use  of  the  Galerkin  optimal  control  formulation  outlined  here 
with  LGR  or  F-LGR  nodes  does  not  automatically  provide  the  optimal  control  solutions  at 
one  endpoint  of  the  domain  (this  applies  to  t  =  1  for  LGR  points  and  t  =  —  1  for  F-LGR 
points).  Flow  ever,  if  required  by  the  application,  the  control  solutions  at  t  =  1  or  t  =  —  1 
may  be  found  by  interpolation  of  the  control  approximation,  uN  with  a  possible  reduction 
in  accuracy. 

5.5.2.2.  Method  for  Approximation  for  Galerkin  Optimal  Control  with  LG  Nodes 

In  this  section,  a  formulation  is  proposed  to  solve  a  specific  optimal  control  prob¬ 
lem,  one  in  which  a  complete  set  of  boundary  conditions  are  provided  for  the  problem 
dynamics.  Consider  Problem  B  such  that 

e(x(— 1),  x(l))  =  [x(— 1)  —  x0,x(l)  —  x*]  1  =  [0,  Of  , 

where  x°  and  x?  are  constants.  As  with  the  GOCM-W,  we  will  consider  the  weak  enforce¬ 
ment  of  the  boundary  conditions,  however,  now  we  will  use  the  Galerkin  weak  form.  In 
this  approximation  to  Problem  B,  the  state  trajectory,  x(t),  is  approximated  with  globally 
interpolating  iV-th  order  Lagrange  polynomials,{0jv}  A=0,  defined  on  a  grid  of  LG  nodes, 

MU 

N 

x(t)  ~  xN(t)  =  ^2ff(t)xNF 

3=0 

Due  to  the  property  of  the  Lagrange  polynomials,  fft,;)  =  dtJ,  we  have 

xNj  =  xN(tj ),  .7  =  0,1,...*#. 
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Also,  let  uN(t)  be  an  interpolating  function  of  {uNj}^=  0, 

N 

uN(t)  =  ^il)f{t)uN\ 

3=0 

where  {'0jV}jLo  is  any  set  of  continuous  functions  (not  necessarily  polynomials)  with  the 
property  ^(U)  =  d%J.  Taking  the  weak  integral  form  of  x  —  f{x,  u)  =  0  yields  [44] 

{-Jp-  ~  dt  =  0, 

for  i  —  0, 1, . . . ,  N.  Integration  by  parts  on  the  first  term  results  in  Galerkin  weak  form. 


In  terms  of  our  approximating  polynomials  (and  introducing  the  true  initial  condition, 
xN(— 1)  — y  x{— 1)  and  xN(l)  — >  a;(l))  we  have 

~dir^dt^Nj  -^(_1)a;(_1) + (i)x(i)  -  f  <tfrf(xN,uN)dt  =  o, 

for  i  =  0,1,...,  N.  By  letting  x(— 1)  =  x°  and  x(l)  =  xf ,  the  expression  may  be 
simplified  as 

N 

DijXNi  +  Ki  —  cm  =  0, 

3=0 

for  each  i  —  0, 1, . . . ,  N,  where 

«*  =  (-1)®°  +  (1)^. 
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The  (N  +  1)  x  (N  +  1)  differentiation  matrix,  I),  is  defined  by 


Dij  =  - 


'-1 


d4i{t) 

dt 


</>f( t)dt ,  =  0, 


If  LG  quadrature  rule  is  used  with  Q  =  N  quadrature  points,  the  differentiation  matrix,  D, 
can  be  calculated  exactly  by  the  relationship 


ij  ^  dt 

k=0 


N 


dt 


(t 


.jjWj 


= 


i,j  =  0,1 


where  are  the  LG  quadrature  weights  (2.48)  and  A%3  =  <i)f(t,)  is  the  LG  PS 

differentiation  matrix. 

The  RHS  vector,  c  can  be  approximated  by  the  relationship 

Q 

Ci  «  cNl  =  y^^(tk)f(xN(tk),uN(tk))wk  =  f(xNt,uNl)wi,  i  =  0, 1, . . . ,  N. 

k= 0 

Remark  5.12.  Recall  that  for  LG  quadrature  rule,  integration  is  exact  for  polynomial  in¬ 
tegrands  of  degree  less  than  or  equal  to  2N  +  1.  If  0  =  N  integration  points  are  used,  the 
RHS  vector  will  integrate  exactly  when  f(x(t),u(t))  is  linear  in  x(t)  and  u{t).  In  the  case 
of  a  nonlinear  function  f,  the  accuracy  of  integration  may  be  improved  by  increasing  the 
number  of  quadrature  points  Q  (See  Section  2.2.2). 


The  dynamical  constraint  becomes 


N 


y ^pijxNj  +  ki  —  cNl 

3=0 


<  6 


N 


i  —  0, 1, . . . ,  N. 


The  path  constraints  are  approximated  by 


h(xN\um)<5N  ■  1,  i  =  0,l,...,N. 
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Lastly,  the  cost  functional  J[x(-),u(-)]  is  approximated  by  the  LG  quadrature  rule, 


N 

J[x(-)X-)]  ~  JN(xN,uN )  =  J2F(xm,uNi)wi  +  E(xno,xnn), 

i=0 

where  xN  =  [x^0  xN  1  •  •  •  and  uN  =  [uwo  m^1  •  •  •  uNN] .  To  allow  for  a  practical 
search  area  for  the  optimal  solution  the  following  constraints  are  included:  xN  G  X  and 

uN  G  U. 

Remark  5.13.  Note  that  the  use  of  the  Galerkin  optimal  control  formulation  outlined  here 
with  LG  nodes  does  not  automatically  provide  the  optimal  control  solutions  at  the  endpoints 
of  the  domain,  t  =  ±1.  However,  if  required  by  the  application,  the  control  solutions  at 
t  =  —  1  and/or  t  =  1  may  be  found  by  interpolation  of  the  control  approximation,  uN  with 
a  possible  reduction  in  accuracy. 
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CHAPTER  6: 

GALERKIN  OPTIMAL  CONTROL  WITH  LEGENDRE 
POLYNOMIAL  TEST  FUNCTIONS 


In  this  chapter,  a  Galerkin  optimal  control  formulation  is  proposed  where  Legen¬ 
dre  polynomials  replace  Lagrange  polynomials  as  test  functions  in  the  weak  integral  ap¬ 
proximation  of  the  problem  dynamics.  The  purpose  of  this  modification  is  highlighted  in 
consistency  Theorem  6.2,  which  is  valid  for  problems  with  discontinuous  controls  (unlike 
consistency  Theorems  4.3  and  4.4  for  Problems  GOCM-S  and  GOCM-S,  respectively). 
Theorem  6.2  proves  that  the  nonlinear  programming  Problems  GOCM-L  is  a  consistent 
approximation  to  the  continuous  optimal  control  Problem  B,  even  those  with  piecewise 
continuous  controls. 

6.1.  Methods  for  Approximation 

In  the  Galerkin  optimal  control  approximation  to  Problem  B,  with  Legendre  poly¬ 
nomial  test  functions,  the  state  trajectory,  x(t),  is  approximated  with  globally  interpolating 
N-th  order  Lagrange  polynomials,  {(j)^}f=0,  defined  on  a  grid  of  LGL  nodes,  {tj}f=0, 

N 

x(t) « xN(t)  =  y (■ t)xNj . 

3=0 

Due  to  the  property  of  the  Lagrange  polynomials,  =  dij,  we  have 

xNj=xN(tj),  j  =  0,l,...,N. 

Also,  let  uN(t)  be  an  interpolating  function  of  {uNj}jL0, 

N 

uN(t)  = 

3=0 
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where  {^}f=0  is  any  set  of  continuous  functions  (not  necessarily  polynomials)  with  the 
property  ifrf  ( ti )  =  dtj .  In  this  formulation,  a  solution  to  the  differential  equation  x  — 
f(x,u )  =  0  may  be  approximated  at  the  LGL  nodes  with  the  following  weak  integral 
formulation 

/  ~  f  dt  =  °> 

for  i  =  0,1 ,N,  where  the  test  functions,  L,  =  fL;  ^ .  are  the  normalized  Legendre 
polynomials  of  order  i.  The  L2-norm  of  Lt  is  given  by  [72]  as 

~~2 
2i  +  l 

In  terms  of  the  approximating  polynomials,  the  system  becomes 

_N ^  r 1  _  r 1 

^  /  Li—jj-dtxNj—  I  Lif(xN,uN)dt  =  0. 
j= o  ^_1  ' 

In  matrix-vector  form,  the  system  becomes 

N 

-  c'i  ~  0-  *  =  0,1,...  ,N, 

3=0 

where  the  (N  +  1)  x  (N  +  1)  differentiation  matrix  is  defined  by 
D%  =  j  Li(t)-^dt,  i.j-  0.1,...,. V, 
and  the  (iV  +  1)  x  1  right-hand- side  (RHS)  vector  c  is  defined  as 

ci  =  [  Li(t)f(xN(t),uN(t))dt,  i  =  0,1, ...,7V. 
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If  LGL  quadrature  rule  is  used,  with  Q  =  N  quadrature  points,  the  differentiation  matrix, 
Dl,  can  be  calculated  exactly  by  the  relationship 


_  am- 

Dij  =22Li(tk)—^-(tk)wk,  i,j  =  0,1,.,., TV. 


If  Q  =  N  quadrature  points  are  used,  the  RHS  vector,  cL,  can  be  approximated  by  the 
relationship 


nL  ^  -Ni  _ 
ci  ~  CL  — 


1  =yMtk)f(xNk,  uNk)wk ,  *  =  0, 1,,* . . ,  N. 


The  system  may  be  simplified  as 


=  *  =  0, 1,  — ,N. 


The  dynamical  constraint  therefore  becomes 


<SN,  i  =  0,l,... 


The  endpoint  conditions  and  path  constraints  are  approximated  similarly  by 


e(xN0,  xNN)  <<5", 


h(xNi,uNi)  <6N  ■  1,  i  =  0,1,...,  N. 


Lastly,  the  cost  functional  J[x(-),u(-)}  is  approximated  by  the  LGL  quadrature  rule, 


J[x(-)X-)]  ~  JN(xN,uN)  =  yF(xm,um)wi  +  E(xno,xnn), 
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where  xN  =  [x^0  xNl  ■  ■  ■  x^^]  and  uN  =  [uwo  uN1  ■  ■  ■  -u^^j .  To  allow  for  a  practical 
search  area  for  the  optimal  solution  the  following  constraints  are  included:  xN  G  X  and 
uN  G  U,  where  X  and  U  are  the  search  regions  that  contain  the  optimal  solution  of  the 
discretized  nonlinear  optimization. 

6.2.  Computation  Strategy 

The  computation  strategy  for  Galerkin  optimal  control  with  Legendre  polynomial 
test  functions  is  presented  in  two  forms.  First,  the  strategy  for  the  continuous  problem, 
in  terms  of  the  approximating  polynomials  is  outlined,  denoted  as  GOCM-L.  Next,  the 
discrete  problem,  discretized  on  a  LGL  grid  is  presented,  denoted  as  GOCM-L. 

6.2.1.  Computation  Strategy  for  GOCM-L 

The  computational  strategy  of  the  GOCM-L  is  to  find  the  feasible  solution  xN(t)  G 
X  and  uN(t)  G  U  for  the  following  cases: 

Case  1.  u(-)  is  piecewise  C°  and  x(-)  G  C°  and  piecewise  C1, 

Case  2.  «(•),  x(-)  G  Hm~1  and  m  >  2, 
that  minimizes 

J(xN(-),u(-))  =  I  F  (xN(t),u(t))dt  + E  (xN(^.l),xN(l))  > 

subject  to  the  Galerkin  constraints 

<MN~a ,  i  =  0,1,...,  iV, 

<  MN~a , 

<  MN~a , 


L 


Li(t )  {iN(t)  -  f  (xN(t),u(t)))  dt 

||e  (xN (—1) ,  xN (l)) 
|| h+  (xN(t),uN(t))\ 


L2 
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where  a  —  \  and  (m  —  1),  for  Case  1  and  2,  respectively;  M  is  a  constant  independent  of 
N  and 


{h,  h  >  0, 
0,  h  <  0. 


6.2.2.  Computation  Strategy  for  GOCM-L 

The  computational  strategy  of  the  GOCM-L  is  to  find  the  feasible  solution  xN(t)  G 
X  and  uN (t )  G  U  that  minimizes 


N 


JN  (xN,  uN)  =  (xn\  um )  Wi  +  E  (: xN0 ,  xNN)  , 


i= 0 


subject  to  the  Galerkin  constraints 


N 


LxNj  -  cm 


3= 0 


<SN ,  i  =  0,l,...,N, 


\e  (xN0,  xNN^j  L  <  8N , 

h(ccNi,um)  <SN  ■  1,  i  =  0,l,...,7V. 


6.3.  Feasibility  of  Solutions 

Theorem  6.1  (Feasibility  of  GOCM-L).  Given  any  feasible  solution  t  H >  (x,u),  for  Prob¬ 
lem  B,  consider  the  following  two  cases: 

Case  1.  u(-)  is  piecewise  C°  and  x(-)  G  C°  and  piecewise  C1, 

Case  2.  u(-),  x(-)  G  Hm~l  and  m  >  2. 

Then,  there  exists  a  positive  integer  N0  such  that,  for  any  N  >  N0,  GOCM-L  has  a  poly¬ 
nomial  feasible  solution,  (xN(t),  uN(t))  such  that 


x(t)  ~xN{t)  \\l2  <  MN~a  and  \\u(t)  -  uN(t)  \\l2  <  MN~a, 
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where  a  —  \  and  (m  —  1),  for  Case  1  and  2,  respectively;  and  M  is  a  positive  constant 
independent  of  N. 

Proof.  Let  pit)  be  the  (N  —  l)-th  order  truncated  Legendre  polynomial  approximation  of 
x(t).  By  Lemmas  4.1  and  4.2  there  is  a  constant  cq  independent  of  N,  for  any  N  >  N0, 
such  that 


\%(t)  ~P(t) ||L2  <  c0N  ", 


where  a  —  \  and  (m  —  1),  for  Case  1  and  2,  respectively.  Define 


x  (t)  =  I  p(s)ds  +  x(— 1). 


Then  p(t)  =  xN{t)  and 


x(t)  -  xN(t)\\L2  <  2 c0N  ", 


since,  from  Holder’s  inequality  (Lemma  4.3),  we  have 


x(t)  —  xN(t )  = 


(x(s)  —  p(s))  ds 


'-i 


<  /  \x(s)  —  p(s)\ds 


'-i 


<a/2  (  /  |i(s)  ~p(s)\2ds  )  =  V2\\x{t)  -  p{t)\\L2  <  s/2cqN 


(6.1) 


Let  uN  it)  be  the  N-th  order  Legendre  polynomial  so  that 


u(t)  -  w  (t)  L2  <  ciN  ". 
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From  our  Galerkin  approximation,  Holder’s  inequality  (Lemma  4.3),  and  the  property, 
\\Li\y  =  \J we  have  for  each  %  —  0, 1, . . . ,  N, 


< 


r 

-i 

Li(t) 


Li{t )  ( xN(t )  -  /  (xN(t),uN(t)))  dt 

-  /  {xN{t),uN(t))\\L2 


2i+l 


L2 


=  uN{t ))  ||l2 

<  || x{t)  -xNit)\\L2  +  ||/  (z(f)Xf))  -  /(xJV(i),«JV(i))||L2 

=  CqN~ 5  +  (i||x(f)  -  ^(t)^  +  (2||«(f)  -  MAr(f)||L2 
<  c0N~a  +  2llCoN~a  +  l2ClN-Q, 

where  li  and  /2  are  the  Lipschitz  constants  of  /  with  respect  to  x  and  u,  respectively,  which 
are  independent  of  N.  It  follows  that 


i: 


Li(t)  {xN(t)  -  /  (xN(t),uN(t)))  dt 


<  MN~a, 


and  holds  for  each  i  —  0, 1, . . . ,  N,  and  all  N  >  N0,  where  M  is  a  constant  independent  of 
N. 

For  the  endpoint  condition  we  have 


x(l)  —  2:^(1)  = 


(i(s)  —  p(s))  ds 


i- i 


—  /  l^(s)  —  p(s)\ds 


'-i 


<V2(  j  |  x(s)  -  p(s)\2ds)  =  V2\\x(t) -p(t)\\L2  <  V2c0N  ", 


so  we  have,  by  Lipschitz  condition, 


e(xN  (—1),  a;  (1))  <  MN~a. 
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For  the  path  constraint  let  V  =  [t\h  (xN(t),  uN(t))  >  0}  ,  V  =  [—1,1]  \D,  since 
h(x(t),u(t ))  <  0.  Then 


< 


h+  (xN(t),uN(t))\\L2  =  ^  (h(xN(t),uN(t)))2  <Uj 
<  (h  (xN  (t),uN  (t))  —  h  (x(t),u(t)))2  dt'j 


(h(xN  (t) ,  vr  (t))  —  h(x(t),u(t)))2dt  +  /  (h(xN  (t) ,  uN  (t))  —  h(x(t),u(t)))2dt 


i v 


I V 


=  ^  J  (h  (xN(t),  uN(t))  —  h  (x(t),  u(t)))2  dt'j 

=  || h(xN(t),uN(t))  -  h(x(t),u(t))\\L2 
<  l3\\x(t )  ~xN(t)\\L2  +  l4\\ u(t)  ~uN(t) \\l2  <  MN~a, 


1 

2 


where  /3  and  l4  are  the  Lipschitz  constants  of  h  with  respect  to  x  and  u,  respectively,  which 
are  independent  of  N.  Hence 


||  h+  (xN(t),uN(t))\\L2<MN~a. 

Thus  a  solution  (xN(t),  uN(t))  to  GOCM-L  is  feasible!  □ 

6.4.  Consistency  of  Solutions 

Theorem  6.2  (Consistency  of  GOCM-L).  Suppose  (xN(t),  uN(t))  is  a  solution  ofGOCM- 
L  and  there  exists  (x(t),u(t))  such  that: 

Case  1.  u  ( • )  is  piecewise  C°  and  x  ( • )  e  C°  and  piecewise  C1, 

Case  2.  u(-),  x(-)  G  Hm~l  and  m  >  2. 

Also,  suppose 


lim  | u(t)  —  uN [t )  ,2  =  0, 

iV— >oo  L 


(6.2) 
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and  xN  (t)  — >  x{t )  uniformly,  thus 


lim  x{t )  —  xN  it)  2  =  0. 

N^oo  ^ 


(6.3) 


Then  ( x(t),u(t ))  satisfies 

( 

l|£(f)  -  f(x(t),u(t)) \\L2  =  o, 
'  e(x(-l),x(l))  =  0, 
h  (x(t),u(t))  <  0, 

and  is  an  optimal  solution  to  Problem  B. 


Proof.  Due  to  the  completeness  of  the  Legendre  polynomials  [80],  to  prove  \\x(t )  —  /  ( x ,  u) \\l2 
0  it  is  sufficient  to  prove 


Lft)  (x(t)  -  f  (x(t),u(t)))  dt 


0, 


for  each  i  =  0, 1, . . . ,  oo  (see  definition  of  completeness,  Definition  4.2).  Consider 


Ift)  {x(t)  -  f  (x(t),u(t)))  dt 


i-i 


< 


i: 


+ 


c 


Lft)  (x(t)  —  xN(t ))  dt 


Lft)  (. xN(t )  -  f  (: xN(t),uN(t )))  dt 

Li(t )  (/  (xN(t),uN(t))  -  f  (x,  u))  dt 

<  MN~a  +  || x(t)  ~xN(t)\\L2  +  || f(x(t),u(t))  -  f  {xN(t),u(t))\\L2 
=  M  N~a  +  ||a;(t)  -  xN(t)  ||i2  +  h\\x(t)  -  xN(t)\\L2  +  Z2||«(*)  - 


c 


where  a  —  \  and  (m  —  1),  for  Case  1  and  2,  respectively  (from  Theorem  6.1);  M  is  a 
positive  constant  and  l\  and  l2  are  the  Lipschitz  constants  of  /  with  respect  to  x  and  u. 
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respectively,  all  independent  of  N.  It  follows  that  as  N  — »  oo  we  have 


II z(t)  -  f(x(t),u(t)) ||L2  =  0. 

For  the  endpoint  condition,  since  xN(t)  —>  x(t)  uniformly,  we  have  xN  ( 1  j  — >  x ( 1  j 
and  xN(— 1)  — *  a; ( —  1  j .  Since,  from  the  formulation  of  the  computational  strategy  we  have 
|e  (xJV(— 1),  xiV(l))  |  <  MN~a,  we  conclude  that  e  (x(— 1),  x(l))  =  0  as  N  — y  oo. 

For  the  path  constraint,  since  h(x(t),u(t ))  is  piecewise  C1,  if  h(x(t*),u(t*))  >0, 
3  an  interval  (a,  b )  in  which  h  (x(t),u(t))  >  0.  Then 

\\h(x(t),u(t))\\L2{afi)=  ^  (h(x(t),u(t)))2  <uj  >0. 

However, 

\\h{x{t),u(t))\\L2{aM  <  II h(x(t),u(t))  -  h+  (xN(t),uN(t))\\L2{ab)  +  || h+  (xN(t),uN(t)) 
<|| h  (x(t),u(t))  -  h  (xN(t),uN(t))  ||i2(a;fe)  +  MN~a 
<  h\\x(t)  -  xN(t))\\L2  +  l4\\u(t)  -  uN(t)) \\l2  +  MN~a, 

where  l3  and  l4  are  the  Lipschitz  constants  of  h  with  respect  to  x  and  u,  respectively, 
which  are  independent  of  N.  Hence,  this  is  a  contradiction,  therefore  h  (x(t).  u(t))  <  0  as 
N  — >  oo. 

Suppose  that  (x(t),u(t))  is  not  optimal.  Then  3  (x* (t) ,  u*  (t))  so  that 

J  (>*(•),  «*(■))  <  J  (®(-)>w(0)- 
Also,  3  (x*(t),u*(t))  such  that 

|| x*N(t)  -x(t)  \\l2<  MN~q  and  \\u*N(t)  -  u(t)  \\l2<  MN~a , 


L2(a,b) 
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where  (x*N(t),  u*N(t))  is  a  feasible  trajectory  of  GOCM-L.  Therefore 

J(x*n(-),u*n(-))>J(xn(-),un(-)).  (6.4) 

However, 

|  J  [xN{-),uN(-))  -  J  (x(-),«(0)| 

<  J  | F  (xN(t),uN(t))  -  F  (x(t),u(t))\dt+  \E  (xN(-l),xN(l))  -  x(-l),  x(l))  | 

<  V2\\F  (xN(t),uN(t))  -  F  (x(t),u(t))  dt\\L2  +  \E  (xN(-l),xN(l))  —  a?(— 1), a?(l)) |. 

Due  to  the  Lipschitz  condition  and  assumptions  (6.2)  and  (6.3)  we  have 

(a;JV(')>“JV(-))  -  J W-)X-))|  =  °- 

Similarly, 

VmJJ  (x*N(-),u*N(-))  -  J  (x*(-),  «*(■))  |  =  0. 

Therefore,  from  (6.4)  we  have 

J  (x*(-),  «*(■))  >  J  (®(-)>w(0)- 

This  is  a  contradiction,  since  we  assumed 


J  (>*(•),  «*(■))  <  J  O(-)X'))- 


We  conclude  that  (x(t),u(t))  achieves  an  optimal  cost  and  therefore  is  an  optimal  solution 
to  Problem  B !  □ 
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Remark  6.1.  Theorems  6.1  and  6.2  show  that  solutions  to  the  GOCM-L  exist  and  provide 
confidence  that  they  will  converge  to  the  optimal  solutions  of  Problem  B.  More  importantly, 
Theorem  6.2  provides  a  foundation  to  show  that  solutions  to  the  GOCM-  L  will  converge  to 
optimal  solutions  with  discontinuities  in  the  control,  such  as  solutions  to  bang-bang  control 
problems.  However,  as  with  the  GOCM-S,  questions  still  remain  about  the  conditions 
under  which  the  underlying  assumptions  exist  ( in  the  case  of  the  GOCM-L,  assumptions 
(6.2)  and  (6.3)),  but  the  required  analysis  is  above  the  scope  of  this  dissertation. 
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CHAPTER  7: 

EXAMPLE  PROBLEMS 


A  number  of  examples  are  provided  in  this  chapter  to  highlight  the  versatility  and 
accuracy  of  the  Galerkin  optimal  control  formulations  discussed  in  Chapters  4—6.  Exam¬ 
ples  7.1,  7.2  and  7.4  demonstrate  the  improved  accuracy  provided  by  the  Galerkin  optimal 
control  formulation  with  weak  enforcement  of  boundary  conditions  over  strong  enforce¬ 
ment  for  problems  with  fixed  boundary  conditions  as  well  as  incomplete  sets  of  end  con¬ 
ditions.  In  particular,  the  examples  show  that  the  GOCM-W  has  an  advantage  over  the 
GOCM-S  for  low  order  approximations  of  control  solutions.  Examples  7.1  and  7.4  also 
show  the  potential  advantages  of  the  Galerkin  formulations  with  F-LGR  and  LG  points,  re¬ 
spectively.  Additionally,  Example  7.3  demonstrates  the  effectiveness  of  the  element-based 
Galerkin  optimal  control  formulations  (such  as  the  GOCM-DG)  when  employed  to  approx¬ 
imated  optimal  control  problems  with  discontinuous  controls.  Lastly,  Examples  7.5  and  7.6 
demonstrate  the  computational  efficiency  in  which  the  multi- scale  Galerkin  optimal  control 
formulation  may  solve  multi-scale  problems,  those  with  states  and  controls  that  evolve  on 
different  timescales.  In  contrast  to  the  difficulties  with  the  multi-scale  Legendre  PS  method 
(see  Section  3.4)  highlighted  in  Chapter  3,  the  GOCM-MS  is  shown  to  successfully  reduce 
the  size  of  multi-scale  problems. 

7.1.  Example  7.1:  Nonlinear  Two-Dimensional  Problem  with  Fixed 
Initial  Conditions 

Consider  the  nonlinear  two-dimensional  problem  with  fixed  initial  conditions  given 
by  Gong  et  al.  [3]  of  minimizing  the  cost  function 

J  =  4xi(2)  +  x2(2)  +  4  f  u2dt,  (7.1) 

Jo 
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subject  to  the  dynamics 


Xi(t)  =  x\  (t)  and  x2  (t)  =  u(t), 


and  with  initial  conditions 


Xi(0)  =  0  and  x2(0)  =  1. 


The  analytic  solution  to  this  problem  is  given  by 


Xi  (t) 
x2  (t) 
u(t ) 


2  64 

5  _  5(2  +  t)5’ 
4 

(2  +  tp 

-8 

(2 Ttf’ 


obtained  via  Pontryagin’s  maximum  principle.  This  problem  was  solved  using  the  GOCM- 
W  (Section  5.1)  with  optimality  and  feasibility  tolerances  of  10~8  and  10”8,  respectively. 
A  two-point  initial  guess  was  provided.  Figure  16  shows  a  comparison  of  the  exact  solution 
with  the  GOCM-W  approximation  and  =  20. 
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Figure  16:  Exact  solution  and  GOCM-W  approximation  with  N  =  20  for  Example  7. 1. 

Figures  17,  18  and  19  show  the  maximum  error  of  the  states  and  control, 

llerrorx1  IL  =  |M*i)  -  arf  (^»)||00,  i  =  0, 1, . .  - ,  N, 
llerro^lL  =  ||*2(ti)  -  arf  («i)||oc,  *  =  0, 1, . . . ,  N, 

llerroru|loc  =  IK<») |L>  *  = 

respectively,  for  the  LPM  (Legendre  PS  method.  Section  3.2),  GOCM-S  (Chapter  4), 
GOCM-W  and  GOCM-FLGR  (Section  5.5.2. 1)  vs.  polynomial  order,  N.  Note  that  the 
GOCM-S  approximations  show  nearly  the  exact  same  exponential  convergence  rates  as  the 
Legendre  PS  method  numerical  solutions  throughout  the  displayed  range  of  N  values. 
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Figure  L7:  State  Xj  approximation  error  vs.  polynomial  order,  N,  for  Example  7.1. 

However,  the  GOCM-W  and  GOCM-FLGR  numerical  solutions  of  x2  and  u  show  a  marked 
improvement  over  that  of  the  GOCM-S.  Observations  from  this  example  show  that  the 
GOCM-W  and  GOCM-FLGR  formulations  have  the  potential  to  provide  increased  accura¬ 
cies  of  the  control  solution  with  lower  order  approximations. 
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Figure  18:  State  x2  approximation  error  vs.  polynomial  order.  N,  for  Example  7. 1 . 


Figure  19:  Control  u  approximation  error  vs.  polynomial  order,  N,  for  Example  7.1. 


163 


7.2.  Example  7.2:  Nonlinear  Two-Dimensional  Problem  with  Fixed 
Boundary  Conditions 


Consider  the  nonlinear  two-dimensional  problem  with  fixed  boundary  conditions 
given  by  Kang  [6]  of  minimizing  the  cost  function 


/*7T 

J  =  /  (1  —  Xi  +  X\X2  +  X\ u)2dt,  (7.2) 

Jo 


subject  to  the  dynamics 

1 

X\ (t)  =  —x\x2  and  T2(£)  =  —  1  H - 

x-i 

and  with  boundary  conditions 

£i(0)  =  1,  z2(0)  =  0,  X1(w)  =  — *— - 


+  0:2  +  sin  t  +  u, 


and  x2(tt)  =  2. 


The  analytic  solution  to  this  problem  is  given  by 


xi  (t) 
(t) 
u(t ) 


1 

1  —  sin  t  +  tJ 
1  —  cos  t, 

—  (£  +  !)+  sin  t  +  cos  t, 


obtained  via  Pontryagin’s  maximum  principle.  This  problem  was  solved  using  the  GOCM- 
W  (Section  5.1)  with  optimality  and  feasibility  tolerances  of  10”8  and  10-8,  respectively. 
A  two-point  initial  guess  was  provided.  Figure  20  shows  a  comparison  of  the  exact  solution 
with  the  GOCM-W  approximation  and  =  20. 
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Figure  20:  Exact  solution  and  GOCM-W  approximation  with  N  =  20  for  Example  7.2. 
Figures  2 1 ,  22  and  23  show  the  maximum  error  of  the  states  and  control, 

llerror*1  Hoc  =  ||*ife)  -  xi  (*i)IL.  *  =0,  i, . . . , jv, 

l|enorxa||00  =  ||a?2(*i)-*ir(*i)||ao,  *  =  0, 1, . . . ,  N, 

llerrorJoc  =  IK<») IL>  *  = 

respectively,  for  the  LPM  (Legendre  PS  method.  Section  3.2),  GOCM-S  (Chapter  4), 
GOCM-W  and  GOCM-OI  (Section  5.5.1)  vs.  polynomial  order,  N.  Note  that  the  GOCM- 
W  approximation  represents  a  formulation  for  which  the  initial  conditions  are  enforced 
weakly  and  the  final  conditions  are  imposed  in  a  strong  sense  through  an  endpoint  con¬ 
straint.  Although  enforcing  all  boundary  conditions  weakly  provides  accurate  solutions, 
a  partial  weak  enforcement  of  end  conditions  produces  higher  accuracies.  Also  note  that 
the  GOCM-OI  approximations  represent  the  over  integration  of  this  GOCM-W  formula¬ 
tion.  The  GOCM-S  approximations  show  nearly  the  exact  same  exponential  convergence 
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rates  as  the  Legendre  PS  method  numerical  solutions  throughout  the  displayed  range  of  N 
values. 


Figure  21:  State  a'i  approximation  error  vs.  polynomial  order.  N,  for  Example  7.2. 

The  GOCM-W  numerical  solutions  of  x2  and  u  show  a  marked  improvement  over  that 
of  the  GOCM-S  for  N  <  22;  the  control  error  for  the  GOCM-W  is  up  to  two  orders  of 
magnitude  lower  than  that  of  the  GOCM-S  for  the  N  values  in  this  range.  Finally,  the 
GOCM-OI  formulation  has  a  smoothing  effect  on  the  accuracies  of  the  GOCM-W  approxi¬ 
mations.  While  the  GOCM-OI  approximations  of  x2  appear  to  be  less  superior  to  that  of  the 
GOCM-W,  the  GOCM-OI  approximations  of  u  gain  slightly  in  accuracy  for  the  lower  order 
approximations.  Observations  from  this  example  show  that  the  GOCM-W  and  GOCM-OI 
formulations  have  the  potential  to  provide  increased  accuracies  of  the  control  solution  with 
lower  order  approximations. 
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Figure  22:  State  x2  approximation  error  vs.  polynomial  order.  N,  for  Example  7.2. 


Figure  23:  Control  u  approximation  error  vs.  polynomial  order,  N,  for  Example  7.2. 
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7.3.  Example  7.3:  Two-Dimensional  Bang-Bang  Control  Problem  with 
Fixed  Boundary  Conditions 

Consider  the  2-dimensional  bang-bang  control  problem  with  fixed  initial  conditions 
given  by  Pinch  [21]  of  controlling  the  system  from  the  initial  point  (a,  b)  at  t  —  0  to  the 
origin  (0,  0)  in  the  shortest  time.  Here,  we  minimizing  the  cost  function 

rff 

J  —  /  dt  =  tf, 

Jo 

subject  to  the  dynamics 


x(t)  =  y(t)  and  y(t)  =  u(t), 
and  with  boundary  conditions 

x(0)  =  a,  2/(0)  =  b,  x(tf )  =  0,  y(tf)  =  0, 

and  control  constraint  \u(t)\  <  k.  For  the  case  where  a  =  1,  b  =  3  and  k  —  1,  the  analytic 
solution  to  this  problem  is  given  by 


x(t)  = 


y(t)  = 


u(t )  = 


—  V  +  3t  +  1,  t  < 


V-  tft+i ,  t>t£, 
—t  +  3,  t  <  t$, 

1 

t  t  J  ^  t  ^ 

-1,  t<tz, 

1,  t  > 


obtained  via  Pontryagin’s  maximum  principle,  where  ^  =  3  +  y  y  and  t j  =  3  +  2  y  y- 
This  problem  was  solved  using  the  GOCM-S  (Chapter  4),  GOCM-W  (Section  5.1),  GOCM- 
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CG  (Section  5.2)  and  GOCM-DG  (Section  5.3).  A  two-point  initial  guess  was  provided  for 
each  approximation. 

Figures  24  and  25  show  comparisons  of  the  exact  solution  with  the  GOCM-S  and 
GOCM-W  numerical  solutions,  respectively,  with  approximation  order,  N  =  20. 


Figure  24:  Exact  solution  and  GOCM-S  approximation  with  N  =  20  for  Example  7.3. 

From  observations,  we  can  see  a  slight  improvement  from  the  GOCM-S  to  the  GOCM- 
W  numerical  solutions,  particularly  in  the  approximation  of  the  control,  u ,  in  the  vicinity 
of  the  discontinuity  location,  t^.  However,  the  GOCM-S  and  GOCM-W  have  difficulty 
in  approximating  the  discontinuous  control  solution.  Even  with  an  increased  approxima¬ 
tion  order,  there  remains  a  maximum  error  of  O(10_1)  for  the  GOCM-S  and  GOCM-W 
approximations  of  the  control. 
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Figure  25:  Exact  solution  and  GOCM-W  approximation  with  N  —  20  for  Example  7.3. 


Figures  26  and  27  show  comparisons  of  the  exact  solution  with  the  GOCM-CG  and 
GOCM-DG  approximations,  respectively,  with  number  of  nonuniform  elements,  Ne  =  2, 
polynomial  order  inside  each  element,  N  =  10,  and  the  boundary  of  the  two  elements 
located  at  t$.  It  should  be  noted  that  in  the  element  based  formulations,  is  defined  as 
a  decision  variable  in  the  NLP.  An  initial  guess  for  is  then  provided  to  the  NLP  with 
bounds  prescribed.  The  total  number  of  points  for  the  GOCM-CG  approximation  is,  Np  = 
(NeN  +  1)  =  21  while  for  the  GOCM-DG  approximation,  Np  =  ( N  -I-  1  )Ne  =  22. 
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Figure  26:  Exact  solution  and  GOCM-CG  approximation  with  Np  =  21,  Ne  =  2  and  the 
boundary  of  the  elements  located  at  for  Example  7.3. 


Note  that  both  the  GOCM-CG  and  GOCM-DG  approximations  of  the  states,  x  and  y, 
achieve  maximum  errors  of  O(10-9)  from  the  exact  states.  Additionally,  the  GOCM-DG 
achieves  an  impressive  O(10-8)  maximum  error  from  the  exact  control,  u,  as  compared 
with  an  accuracy  of  O(10_1)  for  the  GOCM-CG. 
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Figure  27:  Exact  solution  and  GOCM-DG  approximation  with  Np  =  22,  Ne  =  2  and  the 
boundary  of  the  elements  located  at  for  Example  7.3. 


7.4.  Example  7.4:  Two-Dimensional  Problem  with  Fixed  Boundary 
Conditions 


Consider  again  the  linear  two-dimensional  problem  with  fixed  boundary  conditions 
given  in  Example  3. 1  of  minimizing  the  cost  function 


J  = 


(7.3) 


subject  to  the  dynamics 


ii(t)  =  X2  and  x2 (f)  =  Csin(kt)  -I-  «, 


and  with  boundary  conditions 


#1(0)  =0,  £2(0)  =  0,  Xi(tf)  =  1  and  x2(tf)  =  0. 
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The  analytic  solution  to  this  problem  is  given  by  Equations  (3.9)— (3. 1 3),  where  £/  =  10, 
C  =  0.1  and  k  =  8.  This  problem  was  solved  using  the  GOCM-W  (Section  5.1)  with 
optimality  and  feasibility  tolerances  of  10~8  and  10~8.  respectively.  A  two-point  initial 
guess  was  provided  for  each  approximation. 


Figure  28:  Exact  solution  and  GOCM-W  approximation  with  N  =  50  for  Example  7.4. 


llerror^IL  =  |M^)-zf(*i)|L  *  =  AT, 

llerrorx2|loo  =  IMti)-^(<i)||00,  *  =  0,1,...,  N, 

l|enortt||00  =  ||u(««)  —  «JV(^)||00»  *  =  0,1,...,  JV, 

respectively,  for  the  LPM  (Legendre  PS  method.  Section  3.2),  GOCM-S  (Chapter  4), 
GOCM-W  and  GOCM-LG  (Section  5. 5. 2. 2)  vs.  polynomial  order,  N.  Note  that,  as  with 
Example  7.2,  the  GOCM-W  approximation  represents  a  formulation  for  which  the  ini¬ 
tial  conditions  are  enforced  weakly  and  the  final  conditions  are  imposed  in  a  strong  sense 
through  an  endpoint  constraint.  Although  enforcing  all  boundary  conditions  weakly  pro- 
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vides  accurate  solutions,  a  partial  weak  enforcement  of  end  conditions  produces  higher 
accuracies.  Note  that  the  GOCM-S  approximations  show  nearly  the  exact  same  exponen¬ 
tial  convergence  rates  as  the  LPM  numerical  solutions  throughout  the  displayed  range  of 
N  values. 


Figure  29:  State  X\  approximation  error  vs.  polynomial  order,  N,  for  Example  7.4. 
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Figure  30:  State  x2  approximation  error  vs.  polynomial  order.  N,  for  Example  7.4. 


Figure  31:  Control  u  approximation  error  vs.  polynomial  order,  N,  for  Example  7.4. 
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However,  the  GOCM-W  and  GOCM-LG  numerical  solutions  for  u  show  a  marked  im¬ 
provement  over  that  of  the  GOCM-S,  particularly  in  the  lower  order  approximations;  the 
control  error  for  the  GOCM-LG  is  one  to  five  orders  of  magnitude  lower  than  that  of  the 
GOCM-S  in  this  range.  Observations  from  this  example  show  that  the  GOCM-W  and 
GOCM-LG  formulations  have  the  potential  to  provide  increased  accuracies  of  the  control 
solution  with  lower  order  approximations. 

7.5.  Example  7.5:  Two-Dimensional  Multi-scale  Problem  with  Fixed 
Boundary  Conditions 

Consider  again  the  linear  two-dimensional  problem  with  fixed  boundary  conditions 
given  in  Example  3.1  and  7.4.  Here  we  take  advantage  of  the  multi-scale  nature  of  the 
problem.  The  slow  state,  X\,  fast  state,  x2  and  control  u  are  discretized  on  LGL  grids, 
{Tj}^=\ q,  {tj }Jo  and  {Tj}f= 0>  respectively,  where  NX1,NU  <  NX2.  This  problem  was 
solved  with  optimality  and  feasibility  tolerances  of  5  x  10-4  and  5  x  Hr3,  respectively. 
The  exact  solution  was  used  as  an  initial  guess.  Figure  32  shows  a  comparison  of  the  exact 
solution  with  the  GOCM-MS  (Section  5.4)  numerical  solutions  with  NX1  =  10,  NX2  =  50 
and  Nu  =  5. 
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Figure  32:  Exact  solution  and  GOCM-MS  approximation  with  NXI  =  10,  NX2  =  50  and 
Nu  =  5  for  Example  7.5. 

Unlike  the  multi-scale  Legendre  PS  method  outlined  in  Section  3.4,  the  GOCM-MS  pro¬ 
vides  a  feasible  solution.  In  fact,  the  maximum  error  of  the  GOCM-MS  numerical  solutions 
all  have  magnitude  O(10~4).  It  is  also  important  to  point  out  that  the  required  feasibility 
tolerance  of  5  x  10-3  is  the  same  order  of  magnitude  as  the  largest  spectral  coefficient 
dropped  from  the  approximation  (see  Figure  13).  This  observation  is  consistent  with  Re¬ 
mark  3.6.  Additionally,  Remark  3.6  highlights  the  potential  for  a  feasible  solution  with  the 
lower  bounds:  3  <  NX1,  43  <  NX2  and  1  <  Nu  (note  the  associated  magnitudes  of  the 
Legendre  spectral  coefficients  in  Figure  13  for  x\,  x 2  and  u ).  This  is  in  fact  the  case:  Fig¬ 
ure  33  shows  a  comparison  of  the  exact  solution  with  the  GOCM-MS  numerical  solutions 
with  NXI  =  3,  NX2  =  43  and  Nu  —  1,  solved  with  optimality  and  feasibility  tolerances  of 
5  x  10-4  and  5  x  10-3,  respectively. 
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Figure  33:  Exact  solution  and  GOCM-MS  approximation  with  NX1  =  3,  NX2  =  43  and 
Nu  =  1  for  Example  7.5. 

The  maximum  errors  of  the  new  GOCM-MS  approximation  (with  NX1  =  3,  NX2  =  43  and 
Nu  =  1)  remain  at  O(10~4)  for  the  control,  u,  and  O(10-3)  for  the  states,  x\  and  x%  Note 
that  when  the  full-scale  approach  was  used  in  Example  7.4,  a  GOCM-W  approximation 
order  of  N  >  40  was  required  for  the  same  order  of  accuracies.  It  is  clear  that  the  size  of 
the  NLP  for  this  problem  has  been  reduced  significantly.  The  number  of  decision  variables 
needed  for  the  multi-scale  Galerkin  optimal  control  formulation  is  nearly  33%  of  that  re¬ 
quired  of  the  full-scale  problem  solved  with  the  GOCM-W.  Additionally,  this  multi-scale 
approach  has  the  potential  to  more  efficiently  solve  a  great  number  of  optimal  control  prob¬ 
lems.  The  larger  the  dimension  of  the  multi-scale  problem,  the  greater  the  potential  savings 
in  computational  efficiency! 
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7.6.  Example  7.6:  Nonlinear  Two-Dimensional  Multi-scale  Problem 
with  Fixed  Initial  Conditions 


Consider  the  following  nonlinear  two-dimensional  problem  with  fixed  initial  con¬ 
ditions  given  by  Gong  et  al.  [71]  of  minimizing  the  cost  function 


J 


(xi  —  t)2dt, 


(7.4) 


subject  to  the  dynamics 


Xi(t)  =  sin  (50a;i )  +  x2  and  x2 (t)  =  u, 


(7.5) 


and  with  initial  conditions 


Xi(0)  =  0  and  x2(0)  =  1.  (7.6) 

The  analytic  solution  to  this  problem  is  given  by 


xi  (t)  =  t, 

x2{t)  =  1  —  sin  (50t), 
u(t)  =  — 50cos  (50f), 

obtained  via  Pontryagin’s  maximum  principle.  Figures  34  and  35  show  a  comparison  of 
the  exact  state  and  control  solutions  with  the  GOCM-W  (Section  5.1)  numerical  solutions, 
respectively,  with  N  =  45. 
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Figure  34:  Exact  state  solutions  and  GOCM-W  approximation  with  N  =  45  for  Exam 
pie  7.6. 


Figure  35:  Exact  control  solution  and  GOCM-W  approximation  with  N  =  45  for  Exam 
pie  7.6. 
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It  is  deal'  from  Figures  34  and  35  that  problem  (7.4)— (7.6)  may  be  considered  multi¬ 
scale,  where  X\  is  the  slow  state  and  x2  the  fast  state.  Consider  this  problem  now  recast  in 
the  form  of  Problem  B.  Here,  we  discretize  the  slow  state,  x\,  fast  state.  x2,  and  control,  u . 
on  LGL  grids,  {fj}J=o  anc'  {^j}j=0’  respectively.  In  order  to  determine  NXl,  NX2 

and  Nu  we  consider  the  spectral  coefficients  of  Xi,  x2  and  u  given  in  Figure  36. 


(a)  Legendre  spectral  coefficients  for  state  xj. 


0  5  10  15  20  25  30  35  40  45  50 


n 

(b)  Legendre  spectral  coefficients  for  state  x2. 


n 

(c)  Legendre  spectral  coefficients  for  control  u. 

Figure  36:  Legendre  spectral  coefficients  for  x\,  x2  and  u  for  Example  7.6. 
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If  the  feasibility  tolerance  is  set  to  10~3,  Remark  3.6  highlights  the  potential  for  a 
feasible  solution  with  the  bounds:  1  <  NXI,  39  <  NX2  and  42  <  Nu.  This  is  in  fact  the 
case:  Figures  37  and  38  show  comparisons  of  the  exact  states  and  control  with  the  GOCM- 
MS  (Section  5.4)  numerical  solutions  with  NX1  =  1,  NX2  =  39  and  Nu  =  42.  solved  with 
optimality  and  feasibility  tolerances  of  10-4  and  10-3,  respectively. 


Figure  37:  Exact  state  solutions  and  GOCM-MS  approximation  with  NXl  =  1  and  NX2  = 
39  for  Example  7.6. 
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Figure  38:  Exact  control  solution  and  GOCM-MS  approximation  with  Nu  =  42  for  Exam¬ 
ple  7.6. 


The  maximum  errors  of  the  new  GOCM-MS  approximation  (with  NXI  =  1,  NX2  =  39  and 
Nu  =  42)  are  O(10-3)  for  the  control,  u,  and  O(10~6)  and  O(10~4)  for  states,  xx  and  x2, 
respectively. 

7.7.  Summary 

Galerkin  optimal  control  is  a  versatile  family  of  numerical  formulations  for  solving 
optimal  control  problems.  The  examples  shown  in  this  chapter  demonstrate  the  potential 
of  this  Galerkin-based  family  of  formulations  outlined  in  Chapters  4-6.  Three  particular 
highlights  of  Galerkin  optimal  control  is  its  ability  to  weakly  enforce  problem  end  condi¬ 
tions,  handle  problems  with  discontinuous  solutions  and  its  potential  to  reduce  the  size  of 
multi-scale  problems. 

Examples  7.1,  7.2  and  7.4  demonstrate  the  improved  accuracy  provided  by  the 
Galerkin  optimal  control  formulation  with  weak  enforcement  of  boundary  conditions  over 
strong  enforcement  for  both  boundary  value  problems  as  well  as  problems  with  incomplete 
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sets  of  end  conditions.  In  particular,  the  GOCM-W  shows  an  advantage  over  the  GOCM- 
S  for  low  order  approximations  of  control  solutions.  Examples  7.1  and  7.4  also  show 
the  potential  advantages  of  the  Galerkin  formulations  with  F-LGR  and  LG  points,  respec¬ 
tively.  Example  7.3  demonstrates  the  effectiveness  of  the  element-based  Galerkin  optimal 
control  formulations  (such  as  the  GOCM-DG)  when  employed  to  approximated  optimal 
control  problems  with  discontinuous  controls.  Lastly,  Examples  7.5  and  7.6  demonstrate 
the  computational  efficiency  in  which  the  multi-scale  Galerkin  optimal  control  formula¬ 
tion  may  solve  multi-scale  problems,  those  with  states  and  controls  that  evolve  on  different 
timescales.  In  contrast  to  the  difficulties  with  the  multi-scale  Legendre  PS  method  (see 
Section  3.4)  highlighted  in  Chapter  3,  the  GOCM-MS  is  shown  to  successfully  reduce  the 
size  of  multi-scale  problems. 
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CHAPTER  8: 

CONCLUSIONS  AND  AREAS  FOR  FUTURE  RESEARCH 


8.1.  Dissertation  Summary 

This  dissertation  introduced  and  developed  the  theory  for  a  Galerkin-based  family 
of  numerical  formulations  that  calculate  optimal  trajectories  by  discretizing  the  system 
dynamics  using  Galerkin  numerical  techniques  and  approximate  the  cost  function  with 
Gaussian  quadrature.  An  important  result  of  the  Galerkin  formulations  are  that  they  can  be 
used  to  prove  feasibility  and  consistency  theorems  that  apply  to  optimal  control  problems 
with  continuous  and/or  piecewise  continuous  controls.  It  was  shown  that  Galerkin  optimal 
control  may  be  formulated  in  a  variety  of  ways  to  allow  for  efficiency  and/or  improved 
accuracy  while  solving  a  wide  range  of  optimal  control  problems. 

A  highlight  of  Galerkin  optimal  control  is  its  ability  to  be  formulated  to  enforce 
boundary  conditions  in  a  weak  sense,  imposing  end  conditions  only  up  to  the  accuracy 
of  the  numerical  approximation  itself.  The  increased  approximation  accuracy  of  the  weak 
boundary  formulation  (particularly  in  the  approximation  of  control  solutions)  was  shown  on 
several  linear  and  nonlinear  problems.  It  was  also  demonstrated  that  the  Galerkin  optimal 
control  formulation  with  weak  imposition  of  end  conditions  allows  for  problem  discretiza¬ 
tions  with  other  than  LGL  points.  Galerkin  optimal  control  with  Legendre-Gauss-Radau 
and  Legendre-Gauss  points  were  shown  to  be  advantageous  due  to  the  increased  accuracy 
of  solutions.  Galerkin  optimal  control  may  also  be  formulated  with  other  than  Lagrangian 
test  functions,  such  as  the  Legendre  polynomials. 

In  addition,  Galerkin  optimal  control  has  proven  to  be  effective  in  reducing  the 
dimension  of  multi-scale  problems,  those  in  which  states  and  controls  evolve  on  different 
timescales.  In  one  example  presented  the  number  of  decision  variables  required  by  the 
multi-scale  Galerkin  optimal  control  formulation  was  nearly  33%  of  that  required  of  the 
full-scale  problem.  The  multi-scale  formulation  has  the  potential  to  more  efficiently  solve 
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a  great  number  of  optimal  control  problems,  certainly  those  that  include  fast  and  slow 
dynamics. 

For  optimal  control  problems  with  discontinuities  (such  as  bang-bang  control  prob¬ 
lems),  an  element-based  approach  has  shown  to  be  beneficial.  In  general,  using  the  element- 
based  Galerkin  optimal  control  formations  discussed  (both  continuous  and  discontinuous 
Galerkin)  may  lead  to  higher  computational  efficiencies  and  the  formulation  may  be  re¬ 
tooled  to  incorporate  hp-adaptive  techniques,  such  as  the  spectral  algorithm  discussed  in 
[86].  Additionally,  the  discontinuous  Galerkin  formulation  may  be  advantageous  from  a 
parallel  computing  standpoint. 

Galerkin  optimal  control  has  demonstrated  exponential  convergence  for  a  large 
class  of  problems.  It  is  clear  that  Galerkin  optimal  control  is  a  versatile  and  accurate  family 
of  formulations  that  has  the  potential  to  provide  real  time  optimal  control  solutions  for  a 
number  of  applications. 

8.2.  Future  Work 

Galerkin  optimal  control  shows  the  potential  for  solving  a  wide  range  of  optimal 
control  problems  with  a  variety  of  state  and  control  constraints.  However,  application  of 
the  Galerkin  optimal  control  formulations  to  real-world  problems  have  been  somewhat 
limited  due  to  research  time  limitations.  Future  application  of  Galerkin  optimal  control  to 
problems  with  real-world  conditions  is  necessary  to  demonstrate  its  true  versitiliy.  Testing 
Galerkin  optimal  control  on  different  types  of  control  problems  will  inevitably  highlight  the 
strengths  (and  weaknesses)  of  each  formulation.  Lastly,  this  dissertation  includes  a  number 
of  important  theorems  that  serve  as  the  theoretical  foundations  for  Galerkin  optimal  control, 
however,  the  list  is  not  complete.  Future  work  to  increase  the  theoretical  underpinnings  of 
Galerkin  optimal  control,  to  include  a  rate  of  convergence  analysis,  is  forthcoming. 
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APPENDIX  A: 

NORMS  AND  FUNCTIONAL  SPACES 


Throughout  this  dissertation,  the  Lv  spaces,  for  1  <  p  <  oo,  are  used  quite  often. 
They  consist  of  all  measurable  functions  v  :  [— 1, 1]  — >  R,  such  that  [87] 


\v(t)\pdt  <  oo , 


'-i 


with  finite  ZT-norm, 


\V\\lv  = 


'-1 


\v(t)\pdt  <  oo 


In  particular,  the  L2  space  is  referenced  quite  readily,  defined  by 


•  i 


I^IIl2  =  l  /  Ht) |  dt  )  , 


which  is  induced  by  the  inner  product 

(u,v)  —  j  u(t)v(t)dt. 

Additionally,  the  L°°  space  consist  of  all  measurable  functions  v  :  [—1, 1]  — > 
such  that  \v(t)\  <  M  for  almost  all  t  e  [—1, 1].  The  L°°-norm  can  be  expressed  as 

|M|loo  =  inf{M||u(f)|  <  M  almost  everywhere  on  t  G  [—1, 1]}. 


The  idea  of  a  function  of  bounded  variation  is  also  used  within  this  dissertation.  To 
frame  this  idea  first  we  must  define  the  total  variation,  V  (u).  For  a  function  u  :  [—1,1]  — »  M 
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the  total  variation  of  u  on  [—1, 1]  is  defined  as 

N 

V(u)  =  supj^Kij)  -  u{tj- 1)|  {tk}k= o  e  P}, 

3= 1 

where  P  is  the  set  of  all  finite  partitions  of  [-1,1].  If  the  total  variation  is  bounded,  V  (■ u )  < 
oo,  then  u  is  said  to  be  of  bounded  variation  in  [—1, 1], 
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APPENDIX  B: 
DISCRETE  NORMS 


Let  grid  {tj}f=0  be  associated  with  quadrature  weights  {wj}jL0  (see  Section  2.2.2). 
Then  the  discrete  norm,  Ill'll  jv,  from  [46]  is  defined  as  the  quantity 

1 

IMIat  = 


where  the  discrete  inner  product  is  accomplished  by  numerical  integration  and  given  by 


N 

( v,v)n  =  vitjfwj . 

3=0 


In  the  case  that  v2  G  P2at+<5  and  {tj}jL0  are  LG,  LGR  or  LGL  points,  the  numerical 
integration  is  exact  and 


(v,v)N 


u2dt , 


where  5  =  1,0,  —1  for  LG,  LGR  or  LGL  points,  respectively,  and  {wj}jL0  are  the  associ¬ 
ated  weights. 

Additionally,  there  exist  constants  a,  (3  >  0  such  that 


« \\v\\L2  <  |MliV  <  P\\v\\L2 


for  all  v  e  PN- 

Lastly,  11^11^  denotes  the  maximum  element  of  vector,  v  e  Wl. 
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APPENDIX  C: 
SOBOLEV  SPACES 


Throughout  this  dissertation,  the  Sobolev  spaces,  Wm,p ,  are  referenced  quite  often. 
They  consist  of  all  functions,  v  :  [—1, 1]  — *  M"  having  weak  derivative,  e  Lp,  where 
0  <  i  <  m,  with  the  norm  [87] 


1 


where  \\v\\LP  denotes, 


lp  ~ 


\v(t)\pdt 


i 


1 

P 


The  seminorm  may  be  expressed  as 


v 


w m,p-,N 


1 


Additionally,  Sobolev  spaces  may  be  defined  with  a  fractional  order.  They  consist  of  all 
measurable  functions  v  :  [—1, 1]  — *  M,  such  that  [88] 


w°*  =  (.>£!':  /  1  f  |l’(r)  .  T^f-dxdy  <  oo}, 


1-1  J  —1 


\x  -  y I 


for  0  <  a  <  1  and  1  <  p  <  oo,  with  the  norm, 


I 


\v{t)\pdt+  /  j  . 

-i  7-i  7-i  \x  —  y\ 


Lastly,  due  to  the  extensive  use  of  the  space  Wm'2,  notation  is  simplified  by  letting  If  "'"  '2  = 


Hrn. 
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